H1.8 top features

Top feature 0 in H1.8: (feature 8449

TOP ACTIVATIONS
MAX = 1.496

.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.662
7
Token7
Feature activation+0.953
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
7
Token7
Feature activation+0.953
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
crack
Token crack
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.662
7
Token7
Feature activation+0.953
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
<|endoftext|>
Token<|endoftext|>
Feature activation+0.662
7
Token7
Feature activation+0.953
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
or
Token or
Feature activation+0.000
crack
Token crack
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.662
7
Token7
Feature activation+0.953
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
ers
Tokeners
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.662
7
Token7
Feature activation+0.953
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
is
Token is
Feature activation+0.758
the
Token the
Feature activation+0.720
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
is
Token is
Feature activation+0.758
the
Token the
Feature activation+0.720
best
Token best
Feature activation+0.785
thing
Token thing
Feature activation+1.119
you
Token you
Feature activation+0.896
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
is
Token is
Feature activation+0.758
the
Token the
Feature activation+0.720
best
Token best
Feature activation+0.785
thing
Token thing
Feature activation+1.119
you
Token you
Feature activation+0.896
can
Token can
Feature activation+0.716
do
Token do
Feature activation+0.757
for
Token for
Feature activation+0.658
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
is
Token is
Feature activation+0.758
the
Token the
Feature activation+0.720
best
Token best
Feature activation+0.785
thing
Token thing
Feature activation+1.119
you
Token you
Feature activation+0.896
can
Token can
Feature activation+0.716
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
is
Token is
Feature activation+0.758
the
Token the
Feature activation+0.720
best
Token best
Feature activation+0.785
thing
Token thing
Feature activation+1.119
you
Token you
Feature activation+0.896
can
Token can
Feature activation+0.716
do
Token do
Feature activation+0.757

Top DFA by src position
MAX = 0.573

bread
Token bread
Feature activation+0.088
Top resid features:
or
Token or
Feature activation+0.039
Top resid features:
crack
Token crack
Feature activation+0.063
Top resid features:
ers
Tokeners
Feature activation+0.062
Top resid features:
.
Token.
Feature activation+0.032
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.421
Top resid features:
7
Token7
Feature activation+0.130
Top resid features:
Ways
Token Ways
Feature activation+0.264
Top resid features:
to
Token to
Feature activation+0.247
Top resid features:
Support
Token Support
Feature activation+0.235
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
bread
Token bread
Feature activation+0.094
Top resid features:
or
Token or
Feature activation+0.037
Top resid features:
crack
Token crack
Feature activation+0.063
Top resid features:
ers
Tokeners
Feature activation+0.064
Top resid features:
.
Token.
Feature activation+0.033
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.322
Top resid features:
7
Token7
Feature activation+0.093
Top resid features:
Ways
Token Ways
Feature activation+0.178
Top resid features:
to
Token to
Feature activation+0.172
Top resid features:
Support
Token Support
Feature activation+0.244
Top resid features:
a
Token a
Feature activation+0.095
Top resid features:
bread
Token bread
Feature activation+0.097
Top resid features:
or
Token or
Feature activation+0.031
Top resid features:
crack
Token crack
Feature activation+0.048
Top resid features:
ers
Tokeners
Feature activation+0.048
Top resid features:
.
Token.
Feature activation+0.024
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.271
Top resid features:
7
Token7
Feature activation+0.071
Top resid features:
Ways
Token Ways
Feature activation+0.089
Top resid features:
to
Token to
Feature activation+0.094
Top resid features:
Support
Token Support
Feature activation+0.165
Top resid features:
a
Token a
Feature activation+0.082
Top resid features:
bread
Token bread
Feature activation+0.086
Top resid features:
or
Token or
Feature activation+0.035
Top resid features:
crack
Token crack
Feature activation+0.057
Top resid features:
ers
Tokeners
Feature activation+0.060
Top resid features:
.
Token.
Feature activation+0.030
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.305
Top resid features:
7
Token7
Feature activation+0.080
Top resid features:
Ways
Token Ways
Feature activation+0.150
Top resid features:
to
Token to
Feature activation+0.134
Top resid features:
Support
Token Support
Feature activation+0.234
Top resid features:
a
Token a
Feature activation+0.093
Top resid features:
bread
Token bread
Feature activation+0.083
Top resid features:
or
Token or
Feature activation+0.043
Top resid features:
crack
Token crack
Feature activation+0.060
Top resid features:
ers
Tokeners
Feature activation+0.067
Top resid features:
.
Token.
Feature activation+0.035
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.389
Top resid features:
7
Token7
Feature activation+0.115
Top resid features:
Ways
Token Ways
Feature activation+0.177
Top resid features:
to
Token to
Feature activation+0.162
Top resid features:
Support
Token Support
Feature activation+0.268
Top resid features:
a
Token a
Feature activation+0.089
Top resid features:
bread
Token bread
Feature activation+0.079
Top resid features:
or
Token or
Feature activation+0.033
Top resid features:
crack
Token crack
Feature activation+0.051
Top resid features:
ers
Tokeners
Feature activation+0.053
Top resid features:
.
Token.
Feature activation+0.025
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.251
Top resid features:
7
Token7
Feature activation+0.081
Top resid features:
Ways
Token Ways
Feature activation+0.107
Top resid features:
to
Token to
Feature activation+0.104
Top resid features:
Support
Token Support
Feature activation+0.144
Top resid features:
a
Token a
Feature activation+0.087
Top resid features:
bread
Token bread
Feature activation+0.088
Top resid features:
or
Token or
Feature activation+0.037
Top resid features:
crack
Token crack
Feature activation+0.050
Top resid features:
ers
Tokeners
Feature activation+0.047
Top resid features:
.
Token.
Feature activation+0.025
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.288
Top resid features:
7
Token7
Feature activation+0.062
Top resid features:
Ways
Token Ways
Feature activation+0.082
Top resid features:
to
Token to
Feature activation+0.077
Top resid features:
Support
Token Support
Feature activation+0.125
Top resid features:
a
Token a
Feature activation+0.083
Top resid features:
bread
Token bread
Feature activation+0.073
Top resid features:
or
Token or
Feature activation+0.039
Top resid features:
crack
Token crack
Feature activation+0.056
Top resid features:
ers
Tokeners
Feature activation+0.055
Top resid features:
.
Token.
Feature activation+0.023
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.294
Top resid features:
7
Token7
Feature activation+0.092
Top resid features:
Ways
Token Ways
Feature activation+0.132
Top resid features:
to
Token to
Feature activation+0.122
Top resid features:
Support
Token Support
Feature activation+0.145
Top resid features:
a
Token a
Feature activation+0.104
Top resid features:
bread
Token bread
Feature activation+0.063
Top resid features:
or
Token or
Feature activation+0.041
Top resid features:
crack
Token crack
Feature activation+0.054
Top resid features:
ers
Tokeners
Feature activation+0.062
Top resid features:
.
Token.
Feature activation+0.041
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.537
Top resid features:
7
Token7
Feature activation+0.163
Top resid features:
Ways
Token Ways
Feature activation+0.204
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
Support
Token Support
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
bread
Token bread
Feature activation+0.048
Top resid features:
or
Token or
Feature activation+0.039
Top resid features:
crack
Token crack
Feature activation+0.054
Top resid features:
ers
Tokeners
Feature activation+0.058
Top resid features:
.
Token.
Feature activation+0.029
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.367
Top resid features:
7
Token7
Feature activation+0.107
Top resid features:
Ways
Token Ways
Feature activation+0.174
Top resid features:
to
Token to
Feature activation+0.216
Top resid features:
Support
Token Support
Feature activation+0.313
Top resid features:
a
Token a
Feature activation+0.065
Top resid features:
bread
Token bread
Feature activation+0.069
Top resid features:
or
Token or
Feature activation+0.040
Top resid features:
crack
Token crack
Feature activation+0.051
Top resid features:
ers
Tokeners
Feature activation+0.057
Top resid features:
.
Token.
Feature activation+0.023
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.286
Top resid features:
7
Token7
Feature activation+0.077
Top resid features:
Ways
Token Ways
Feature activation+0.110
Top resid features:
to
Token to
Feature activation+0.098
Top resid features:
Support
Token Support
Feature activation+0.160
Top resid features:
a
Token a
Feature activation+0.094
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.199
Top resid features:
4
Token4
Feature activation+0.036
Top resid features:
tsp
Token tsp
Feature activation+0.088
Top resid features:
kosher
Token kosher
Feature activation+0.072
Top resid features:
salt
Token salt
Feature activation+0.073
Top resid features:
Ċ
TokenĊ
Feature activation+0.013
Top resid features:
bread
Token bread
Feature activation+0.052
Top resid features:
or
Token or
Feature activation+0.043
Top resid features:
crack
Token crack
Feature activation+0.060
Top resid features:
ers
Tokeners
Feature activation+0.065
Top resid features:
.
Token.
Feature activation+0.036
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.573
Top resid features:
7
Token7
Feature activation+0.047
Top resid features:
Ways
Token Ways
Feature activation+0.000
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
Support
Token Support
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
bread
Token bread
Feature activation+0.049
Top resid features:
or
Token or
Feature activation+0.043
Top resid features:
crack
Token crack
Feature activation+0.057
Top resid features:
ers
Tokeners
Feature activation+0.059
Top resid features:
.
Token.
Feature activation+0.024
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.420
Top resid features:
7
Token7
Feature activation+0.130
Top resid features:
Ways
Token Ways
Feature activation+0.322
Top resid features:
to
Token to
Feature activation+0.092
Top resid features:
Support
Token Support
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
bread
Token bread
Feature activation+0.073
Top resid features:
or
Token or
Feature activation+0.027
Top resid features:
crack
Token crack
Feature activation+0.038
Top resid features:
ers
Tokeners
Feature activation+0.042
Top resid features:
.
Token.
Feature activation+0.020
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.209
Top resid features:
7
Token7
Feature activation+0.056
Top resid features:
Ways
Token Ways
Feature activation+0.077
Top resid features:
to
Token to
Feature activation+0.063
Top resid features:
Support
Token Support
Feature activation+0.106
Top resid features:
a
Token a
Feature activation+0.070
Top resid features:
bread
Token bread
Feature activation+0.039
Top resid features:
or
Token or
Feature activation+0.028
Top resid features:
crack
Token crack
Feature activation+0.043
Top resid features:
ers
Tokeners
Feature activation+0.039
Top resid features:
.
Token.
Feature activation+0.021
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.254
Top resid features:
7
Token7
Feature activation+0.062
Top resid features:
Ways
Token Ways
Feature activation+0.090
Top resid features:
to
Token to
Feature activation+0.078
Top resid features:
Support
Token Support
Feature activation+0.112
Top resid features:
a
Token a
Feature activation+0.081
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.194
Top resid features:
4
Token4
Feature activation+0.026
Top resid features:
tsp
Token tsp
Feature activation+0.056
Top resid features:
kosher
Token kosher
Feature activation+0.040
Top resid features:
salt
Token salt
Feature activation+0.057
Top resid features:
Ċ
TokenĊ
Feature activation+0.014
Top resid features:
supportive
Token supportive
Feature activation+0.094
Top resid features:
,
Token,
Feature activation+0.062
Top resid features:
helpful
Token helpful
Feature activation+0.076
Top resid features:
friend
Token friend
Feature activation+0.086
Top resid features:
is
Token is
Feature activation+0.100
Top resid features:
the
Token the
Feature activation+0.244
Top resid features:
best
Token best
Feature activation-0.028
Top resid features:
thing
Token thing
Feature activation+0.000
Top resid features:
you
Token you
Feature activation+0.000
Top resid features:
can
Token can
Feature activation+0.000
Top resid features:
do
Token do
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.204
Top resid features:
4
Token4
Feature activation+0.029
Top resid features:
tsp
Token tsp
Feature activation+0.067
Top resid features:
kosher
Token kosher
Feature activation+0.051
Top resid features:
salt
Token salt
Feature activation+0.057
Top resid features:
Ċ
TokenĊ
Feature activation+0.013
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.193
Top resid features:
4
Token4
Feature activation+0.028
Top resid features:
tsp
Token tsp
Feature activation+0.062
Top resid features:
kosher
Token kosher
Feature activation+0.045
Top resid features:
salt
Token salt
Feature activation+0.053
Top resid features:
Ċ
TokenĊ
Feature activation+0.013
Top resid features:

Decoder Weights Distribution

Head 0: 0.05

Head 1: 0.08

Head 2: 0.07

Head 3: 0.03

Head 4: 0.08

Head 5: 0.06

Head 6: 0.06

Head 7: 0.10

Head 8: 0.19

Head 9: 0.12

Head 10: 0.07

Head 11: 0.09

Positive logits

�士1.44

Privacy1.27

1.26

ART1.25

Events1.24

OTOS1.21

Gamergate1.20

exhib1.16

��1.15

Breed1.13

announcements1.13

actions1.13

moderators1.12

esports1.12

translations1.11

1.10

Reloaded1.10

demographics1.08

1.07

フォ1.07

Negative logits

icho-1.82

ferment-1.51

oba-1.48

chlor-1.46

washing-1.45

caul-1.41

butter-1.41

worm-1.38

coconut-1.38

icer-1.38

broth-1.35

rice-1.35

shrimp-1.35

fry-1.34

bowl-1.34

hews-1.33

milk-1.33

contam-1.32

oil-1.31

conco-1.31

INTERVAL 1.347 - 1.496
CONTAINS 0.000%

.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.662
7
Token7
Feature activation+0.953
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703

INTERVAL 1.197 - 1.347
CONTAINS 0.000%

7
Token7
Feature activation+0.953
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976

INTERVAL 1.047 - 1.197
CONTAINS 0.001%

crack
Token crack
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.662
7
Token7
Feature activation+0.953
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
<|endoftext|>
Token<|endoftext|>
Feature activation+0.662
7
Token7
Feature activation+0.953
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
's
Token's
Feature activation+1.183
Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860

INTERVAL 0.898 - 1.047
CONTAINS 0.000%

or
Token or
Feature activation+0.000
crack
Token crack
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.662
7
Token7
Feature activation+0.953
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
ers
Tokeners
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.662
7
Token7
Feature activation+0.953
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
Friend
Token Friend
Feature activation+1.283
Who
Token Who
Feature activation+1.450
's
Token's
Feature activation+1.183
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800

INTERVAL 0.748 - 0.898
CONTAINS 0.001%

Question
Token Question
Feature activation+1.317
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
is
Token is
Feature activation+0.758
the
Token the
Feature activation+0.720
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
is
Token is
Feature activation+0.758
the
Token the
Feature activation+0.720
best
Token best
Feature activation+0.785
thing
Token thing
Feature activation+1.119
you
Token you
Feature activation+0.896
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
is
Token is
Feature activation+0.758
the
Token the
Feature activation+0.720
best
Token best
Feature activation+0.785
thing
Token thing
Feature activation+1.119
you
Token you
Feature activation+0.896
can
Token can
Feature activation+0.716
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
is
Token is
Feature activation+0.758
the
Token the
Feature activation+0.720
best
Token best
Feature activation+0.785
thing
Token thing
Feature activation+1.119
you
Token you
Feature activation+0.896
can
Token can
Feature activation+0.716
do
Token do
Feature activation+0.757
for
Token for
Feature activation+0.658

INTERVAL 0.598 - 0.748
CONTAINS 0.001%

bread
Token bread
Feature activation+0.000
or
Token or
Feature activation+0.000
crack
Token crack
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.662
7
Token7
Feature activation+0.953
Ways
Token Ways
Feature activation+1.177
to
Token to
Feature activation+0.935
Support
Token Support
Feature activation+1.496
a
Token a
Feature activation+1.109
ing
Tokening
Feature activation+1.084
Their
Token Their
Feature activation+1.261
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
Sexual
Token Sexual
Feature activation+1.352
ity
Tokenity
Feature activation+1.187
Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
is
Token is
Feature activation+0.758
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
is
Token is
Feature activation+0.758
the
Token the
Feature activation+0.720
best
Token best
Feature activation+0.785
thing
Token thing
Feature activation+1.119
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
is
Token is
Feature activation+0.758
the
Token the
Feature activation+0.720
best
Token best
Feature activation+0.785
thing
Token thing
Feature activation+1.119
you
Token you
Feature activation+0.896
can
Token can
Feature activation+0.716
do
Token do
Feature activation+0.757

INTERVAL 0.449 - 0.598
CONTAINS 0.000%

Ċ
TokenĊ
Feature activation+0.819
Ċ
TokenĊ
Feature activation+0.690
Being
TokenBeing
Feature activation+0.976
a
Token a
Feature activation+0.703
supportive
Token supportive
Feature activation+0.860
,
Token,
Feature activation+0.569
helpful
Token helpful
Feature activation+0.684
friend
Token friend
Feature activation+0.800
is
Token is
Feature activation+0.758
the
Token the
Feature activation+0.720
best
Token best
Feature activation+0.785

INTERVAL 0.299 - 0.449
CONTAINS 0.001%

INTERVAL 0.150 - 0.299
CONTAINS 0.001%

3
Token3
Feature activation+0.000
â̳
Tokenâ̳
Feature activation+0.000
]
Token]
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
Join
TokenJoin
Feature activation+0.096
us
Token us
Feature activation+0.162
on
Token on
Feature activation+0.073
Reddit
Token Reddit
Feature activation+0.140
!
Token!
Feature activation+0.076
More
Token More
Feature activation+0.118
about
Token about
Feature activation+0.081
the
Token the
Feature activation+0.000
Republic
Token Republic
Feature activation+0.000
of
Token of
Feature activation+0.000
Ireland
Token Ireland
Feature activation+0.000
will
Token will
Feature activation+0.000
decide
Token decide
Feature activation+0.218
on
Token on
Feature activation+0.000
May
Token May
Feature activation+0.000
22
Token 22
Feature activation+0.000
nd
Tokennd
Feature activation+0.000
on
Token on
Feature activation+0.000
raining
Token raining
Feature activation+0.000
continuously
Token continuously
Feature activation+0.000
here
Token here
Feature activation+0.000
in
Token in
Feature activation+0.000
Santa
Token Santa
Feature activation+0.000
Barbara
Token Barbara
Feature activation+0.190
from
Token from
Feature activation+0.000
past
Token past
Feature activation+0.000
few
Token few
Feature activation+0.000
weeks
Token weeks
Feature activation+0.000
.
Token.
Feature activation+0.000
More
Token More
Feature activation+0.118
about
Token about
Feature activation+0.081
this
Token this
Feature activation+0.071
in
Token in
Feature activation+0.025
the
Token the
Feature activation+0.088
show
Token show
Feature activation+0.158
!
Token!
Feature activation+0.166
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Had
TokenHad
Feature activation+0.050
a
Token a
Feature activation+0.000
about
Token about
Feature activation+0.081
this
Token this
Feature activation+0.071
in
Token in
Feature activation+0.025
the
Token the
Feature activation+0.088
show
Token show
Feature activation+0.158
!
Token!
Feature activation+0.166
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Had
TokenHad
Feature activation+0.050
a
Token a
Feature activation+0.000
special
Token special
Feature activation+0.000

INTERVAL 0.000 - 0.150
CONTAINS 99.996%

landed
Token landed
Feature activation+0.000
them
Token them
Feature activation+0.000
former
Token former
Feature activation+0.000
Knicks
Token Knicks
Feature activation+0.000
shooting
Token shooting
Feature activation+0.000
guard
Token guard
Feature activation+0.000
Tim
Token Tim
Feature activation+0.000
Hard
Token Hard
Feature activation+0.000
away
Tokenaway
Feature activation+0.000
Jr
Token Jr
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
"
Token"
Feature activation+0.000
It
TokenIt
Feature activation+0.000
has
Token has
Feature activation+0.000
the
Token the
Feature activation+0.000
right
Token right
Feature activation+0.000
to
Token to
Feature activation+0.000
do
Token do
Feature activation+0.000
so
Token so
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
strange
Token strange
Feature activation+0.000
phenomenon
Token phenomenon
Feature activation+0.000
to
Token to
Feature activation+0.000
put
Token put
Feature activation+0.000
it
Token it
Feature activation+0.000
mildly
Token mildly
Feature activation+0.000
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
United
Token United
Feature activation+0.000
We
TokenWe
Feature activation+0.000
can
Token can
Feature activation+0.000
easily
Token easily
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
em
Tokenem
Feature activation+0.000
ulate
Tokenulate
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
lists
Token lists
Feature activation+0.000
by
Token by
Feature activation+0.000
shipping
Token shipping
Feature activation+0.000
costs
Token costs
Feature activation+0.000
,
Token,
Feature activation+0.000
even
Token even
Feature activation+0.000
for
Token for
Feature activation+0.000
non
Token non
Feature activation+0.000
-
Token-
Feature activation+0.000
U
TokenU
Feature activation+0.000
.
Token.
Feature activation+0.000
S
TokenS
Feature activation+0.000
.
Token.
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 1 in H1.8: (feature 17403

TOP ACTIVATIONS
MAX = 1.387

for
Token for
Feature activation+0.000
20
Token 20
Feature activation+0.000
teams
Token teams
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.445
This
TokenThis
Feature activation+1.061
organization
Token organization
Feature activation+1.387
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
teams
Token teams
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.445
This
TokenThis
Feature activation+1.061
organization
Token organization
Feature activation+1.387
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
organization
Token organization
Feature activation+1.387
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984
ileged
Tokenileged
Feature activation+1.241
students
Token students
Feature activation+0.920
discreet
Token discreet
Feature activation+1.038
ly
Tokenly
Feature activation+0.542
and
Token and
Feature activation+0.546
respectfully
Token respectfully
Feature activation+0.742
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984
ileged
Tokenileged
Feature activation+1.241
students
Token students
Feature activation+0.920
This
TokenThis
Feature activation+1.061
organization
Token organization
Feature activation+1.387
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
<|endoftext|>
Token<|endoftext|>
Feature activation+0.293
When
TokenWhen
Feature activation+1.007
you
Token you
Feature activation+0.929
take
Token take
Feature activation+0.862
into
Token into
Feature activation+0.703
account
Token account
Feature activation+1.078
its
Token its
Feature activation+0.820
purchase
Token purchase
Feature activation+0.648
price
Token price
Feature activation+0.605
and
Token and
Feature activation+0.412
all
Token all
Feature activation+0.517
begins
Token begins
Feature activation+0.011
for
Token for
Feature activation+0.000
20
Token 20
Feature activation+0.000
teams
Token teams
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.445
This
TokenThis
Feature activation+1.061
organization
Token organization
Feature activation+1.387
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
<|endoftext|>
Token<|endoftext|>
Feature activation+0.445
This
TokenThis
Feature activation+1.061
organization
Token organization
Feature activation+1.387
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984
ileged
Tokenileged
Feature activation+1.241
students
Token students
Feature activation+0.920
discreet
Token discreet
Feature activation+1.038
ly
Tokenly
Feature activation+0.542
and
Token and
Feature activation+0.546
respectfully
Token respectfully
Feature activation+0.742
.
Token.
Feature activation+0.535
Ċ
TokenĊ
Feature activation+0.394
<|endoftext|>
Token<|endoftext|>
Feature activation+0.203
L
TokenL
Feature activation+0.705
ONDON
TokenONDON
Feature activation+0.781
âĢĶ
Token âĢĶ
Feature activation+0.884
Energy
Token Energy
Feature activation+0.821
companies
Token companies
Feature activation+1.033
have
Token have
Feature activation+0.956
spent
Token spent
Feature activation+0.875
months
Token months
Feature activation+0.802
in
Token in
Feature activation+0.695
a
Token a
Feature activation+0.722
upset
Token upset
Feature activation+0.000
on
Token on
Feature activation+0.000
their
Token their
Feature activation+0.000
minds
Token minds
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.293
When
TokenWhen
Feature activation+1.007
you
Token you
Feature activation+0.929
take
Token take
Feature activation+0.862
into
Token into
Feature activation+0.703
account
Token account
Feature activation+1.078
its
Token its
Feature activation+0.820
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984
ileged
Tokenileged
Feature activation+1.241
students
Token students
Feature activation+0.920
discreet
Token discreet
Feature activation+1.038
ly
Tokenly
Feature activation+0.542
and
Token and
Feature activation+0.546
L
TokenL
Feature activation+0.705
ONDON
TokenONDON
Feature activation+0.781
âĢĶ
Token âĢĶ
Feature activation+0.884
Energy
Token Energy
Feature activation+0.821
companies
Token companies
Feature activation+1.033
have
Token have
Feature activation+0.956
spent
Token spent
Feature activation+0.875
months
Token months
Feature activation+0.802
in
Token in
Feature activation+0.695
a
Token a
Feature activation+0.722
state
Token state
Feature activation+0.885
20
Token 20
Feature activation+0.000
teams
Token teams
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.445
This
TokenThis
Feature activation+1.061
organization
Token organization
Feature activation+1.387
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
on
Token on
Feature activation+0.000
their
Token their
Feature activation+0.000
minds
Token minds
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.293
When
TokenWhen
Feature activation+1.007
you
Token you
Feature activation+0.929
take
Token take
Feature activation+0.862
into
Token into
Feature activation+0.703
account
Token account
Feature activation+1.078
its
Token its
Feature activation+0.820
purchase
Token purchase
Feature activation+0.648
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984
ileged
Tokenileged
Feature activation+1.241
students
Token students
Feature activation+0.920
discreet
Token discreet
Feature activation+1.038
ly
Tokenly
Feature activation+0.542
and
Token and
Feature activation+0.546
respectfully
Token respectfully
Feature activation+0.742
.
Token.
Feature activation+0.535
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984
ileged
Tokenileged
Feature activation+1.241
months
Token months
Feature activation+0.802
in
Token in
Feature activation+0.695
a
Token a
Feature activation+0.722
state
Token state
Feature activation+0.885
of
Token of
Feature activation+0.489
strategic
Token strategic
Feature activation+0.898
paralysis
Token paralysis
Feature activation+0.744
,
Token,
Feature activation+0.496
wary
Token wary
Feature activation+0.512
of
Token of
Feature activation+0.355
making
Token making
Feature activation+0.525
have
Token have
Feature activation+0.956
spent
Token spent
Feature activation+0.875
months
Token months
Feature activation+0.802
in
Token in
Feature activation+0.695
a
Token a
Feature activation+0.722
state
Token state
Feature activation+0.885
of
Token of
Feature activation+0.489
strategic
Token strategic
Feature activation+0.898
paralysis
Token paralysis
Feature activation+0.744
,
Token,
Feature activation+0.496
wary
Token wary
Feature activation+0.512

Top DFA by src position
MAX = 1.127

planning
Token planning
Feature activation+0.047
Top resid features:
begins
Token begins
Feature activation+0.070
Top resid features:
for
Token for
Feature activation+0.051
Top resid features:
20
Token 20
Feature activation+0.038
Top resid features:
teams
Token teams
Feature activation+0.086
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.815
Top resid features:
This
TokenThis
Feature activation+0.427
Top resid features:
organization
Token organization
Feature activation+0.384
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
Montana
Token Montana
Feature activation+0.000
Top resid features:
is
Token is
Feature activation+0.000
Top resid features:
planning
Token planning
Feature activation+0.038
Top resid features:
begins
Token begins
Feature activation+0.061
Top resid features:
for
Token for
Feature activation+0.041
Top resid features:
20
Token 20
Feature activation+0.036
Top resid features:
teams
Token teams
Feature activation+0.117
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.705
Top resid features:
This
TokenThis
Feature activation+0.284
Top resid features:
organization
Token organization
Feature activation+0.184
Top resid features:
in
Token in
Feature activation+0.295
Top resid features:
Montana
Token Montana
Feature activation+0.458
Top resid features:
is
Token is
Feature activation+0.000
Top resid features:
planning
Token planning
Feature activation+0.032
Top resid features:
begins
Token begins
Feature activation+0.051
Top resid features:
for
Token for
Feature activation+0.034
Top resid features:
20
Token 20
Feature activation+0.034
Top resid features:
teams
Token teams
Feature activation+0.087
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.467
Top resid features:
This
TokenThis
Feature activation+0.188
Top resid features:
organization
Token organization
Feature activation+0.095
Top resid features:
in
Token in
Feature activation+0.169
Top resid features:
Montana
Token Montana
Feature activation+0.156
Top resid features:
is
Token is
Feature activation+0.165
Top resid features:
planning
Token planning
Feature activation+0.021
Top resid features:
begins
Token begins
Feature activation+0.037
Top resid features:
for
Token for
Feature activation+0.024
Top resid features:
20
Token 20
Feature activation+0.029
Top resid features:
teams
Token teams
Feature activation+0.109
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.410
Top resid features:
This
TokenThis
Feature activation+0.108
Top resid features:
organization
Token organization
Feature activation+0.099
Top resid features:
in
Token in
Feature activation+0.093
Top resid features:
Montana
Token Montana
Feature activation+0.120
Top resid features:
is
Token is
Feature activation+0.089
Top resid features:
planning
Token planning
Feature activation+0.027
Top resid features:
begins
Token begins
Feature activation+0.046
Top resid features:
for
Token for
Feature activation+0.029
Top resid features:
20
Token 20
Feature activation+0.036
Top resid features:
teams
Token teams
Feature activation+0.098
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.414
Top resid features:
This
TokenThis
Feature activation+0.171
Top resid features:
organization
Token organization
Feature activation+0.122
Top resid features:
in
Token in
Feature activation+0.156
Top resid features:
Montana
Token Montana
Feature activation+0.143
Top resid features:
is
Token is
Feature activation+0.113
Top resid features:
planning
Token planning
Feature activation+0.034
Top resid features:
begins
Token begins
Feature activation+0.055
Top resid features:
for
Token for
Feature activation+0.043
Top resid features:
20
Token 20
Feature activation+0.038
Top resid features:
teams
Token teams
Feature activation+0.077
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.463
Top resid features:
This
TokenThis
Feature activation+0.237
Top resid features:
organization
Token organization
Feature activation+0.130
Top resid features:
in
Token in
Feature activation+0.193
Top resid features:
Montana
Token Montana
Feature activation+0.166
Top resid features:
is
Token is
Feature activation+0.224
Top resid features:
an
Token an
Feature activation+0.026
Top resid features:
upset
Token upset
Feature activation+0.048
Top resid features:
on
Token on
Feature activation+0.032
Top resid features:
their
Token their
Feature activation+0.036
Top resid features:
minds
Token minds
Feature activation+0.056
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.632
Top resid features:
When
TokenWhen
Feature activation+0.318
Top resid features:
you
Token you
Feature activation+0.191
Top resid features:
take
Token take
Feature activation+0.238
Top resid features:
into
Token into
Feature activation+0.189
Top resid features:
account
Token account
Feature activation+0.379
Top resid features:
planning
Token planning
Feature activation+0.047
Top resid features:
begins
Token begins
Feature activation+0.070
Top resid features:
for
Token for
Feature activation+0.054
Top resid features:
20
Token 20
Feature activation+0.045
Top resid features:
teams
Token teams
Feature activation+0.073
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+1.127
Top resid features:
This
TokenThis
Feature activation+0.572
Top resid features:
organization
Token organization
Feature activation+0.000
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
Montana
Token Montana
Feature activation+0.000
Top resid features:
is
Token is
Feature activation+0.000
Top resid features:
planning
Token planning
Feature activation+0.036
Top resid features:
begins
Token begins
Feature activation+0.059
Top resid features:
for
Token for
Feature activation+0.042
Top resid features:
20
Token 20
Feature activation+0.038
Top resid features:
teams
Token teams
Feature activation+0.075
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.555
Top resid features:
This
TokenThis
Feature activation+0.281
Top resid features:
organization
Token organization
Feature activation+0.153
Top resid features:
in
Token in
Feature activation+0.257
Top resid features:
Montana
Token Montana
Feature activation+0.194
Top resid features:
is
Token is
Feature activation+0.422
Top resid features:
of
Token of
Feature activation+0.098
Top resid features:
under
Token under
Feature activation+0.125
Top resid features:
priv
Tokenpriv
Feature activation+0.081
Top resid features:
ileged
Tokenileged
Feature activation+0.168
Top resid features:
students
Token students
Feature activation+0.198
Top resid features:
discreet
Token discreet
Feature activation+0.281
Top resid features:
ly
Tokenly
Feature activation+0.000
Top resid features:
and
Token and
Feature activation+0.000
Top resid features:
respectfully
Token respectfully
Feature activation+0.000
Top resid features:
.
Token.
Feature activation+0.000
Top resid features:
Ċ
TokenĊ
Feature activation+0.000
Top resid features:
true
Token true
Feature activation+0.038
Top resid features:
freshman
Token freshman
Feature activation+0.122
Top resid features:
Z
Token Z
Feature activation+0.041
Top resid features:
ander
Tokenander
Feature activation+0.038
Top resid features:
D
Token D
Feature activation+0.028
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.572
Top resid features:
L
TokenL
Feature activation+0.122
Top resid features:
ONDON
TokenONDON
Feature activation+0.229
Top resid features:
âĢĶ
Token âĢĶ
Feature activation+0.342
Top resid features:
Energy
Token Energy
Feature activation+0.241
Top resid features:
companies
Token companies
Feature activation+0.532
Top resid features:
an
Token an
Feature activation+0.036
Top resid features:
upset
Token upset
Feature activation+0.060
Top resid features:
on
Token on
Feature activation+0.040
Top resid features:
their
Token their
Feature activation+0.041
Top resid features:
minds
Token minds
Feature activation+0.078
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+1.086
Top resid features:
When
TokenWhen
Feature activation+0.614
Top resid features:
you
Token you
Feature activation+0.000
Top resid features:
take
Token take
Feature activation+0.000
Top resid features:
into
Token into
Feature activation+0.000
Top resid features:
account
Token account
Feature activation+0.000
Top resid features:
planning
Token planning
Feature activation+0.022
Top resid features:
begins
Token begins
Feature activation+0.038
Top resid features:
for
Token for
Feature activation+0.022
Top resid features:
20
Token 20
Feature activation+0.033
Top resid features:
teams
Token teams
Feature activation+0.088
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.384
Top resid features:
This
TokenThis
Feature activation+0.113
Top resid features:
organization
Token organization
Feature activation+0.089
Top resid features:
in
Token in
Feature activation+0.102
Top resid features:
Montana
Token Montana
Feature activation+0.124
Top resid features:
is
Token is
Feature activation+0.085
Top resid features:
L
TokenL
Feature activation+0.104
Top resid features:
ONDON
TokenONDON
Feature activation+0.225
Top resid features:
âĢĶ
Token âĢĶ
Feature activation+0.291
Top resid features:
Energy
Token Energy
Feature activation+0.202
Top resid features:
companies
Token companies
Feature activation+0.299
Top resid features:
have
Token have
Feature activation+0.539
Top resid features:
spent
Token spent
Feature activation+0.000
Top resid features:
months
Token months
Feature activation+0.000
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
state
Token state
Feature activation+0.000
Top resid features:
planning
Token planning
Feature activation+0.042
Top resid features:
begins
Token begins
Feature activation+0.073
Top resid features:
for
Token for
Feature activation+0.059
Top resid features:
20
Token 20
Feature activation+0.045
Top resid features:
teams
Token teams
Feature activation+0.082
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.647
Top resid features:
This
TokenThis
Feature activation+0.355
Top resid features:
organization
Token organization
Feature activation+0.170
Top resid features:
in
Token in
Feature activation+0.471
Top resid features:
Montana
Token Montana
Feature activation+0.000
Top resid features:
is
Token is
Feature activation+0.000
Top resid features:
an
Token an
Feature activation+0.036
Top resid features:
upset
Token upset
Feature activation+0.069
Top resid features:
on
Token on
Feature activation+0.045
Top resid features:
their
Token their
Feature activation+0.053
Top resid features:
minds
Token minds
Feature activation+0.075
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.851
Top resid features:
When
TokenWhen
Feature activation+0.466
Top resid features:
you
Token you
Feature activation+0.444
Top resid features:
take
Token take
Feature activation+0.000
Top resid features:
into
Token into
Feature activation+0.000
Top resid features:
account
Token account
Feature activation+0.000
Top resid features:
needs
Token needs
Feature activation+0.113
Top resid features:
of
Token of
Feature activation+0.110
Top resid features:
under
Token under
Feature activation+0.143
Top resid features:
priv
Tokenpriv
Feature activation+0.064
Top resid features:
ileged
Tokenileged
Feature activation+0.155
Top resid features:
students
Token students
Feature activation+0.385
Top resid features:
discreet
Token discreet
Feature activation+0.000
Top resid features:
ly
Tokenly
Feature activation+0.000
Top resid features:
and
Token and
Feature activation+0.000
Top resid features:
respectfully
Token respectfully
Feature activation+0.000
Top resid features:
.
Token.
Feature activation+0.000
Top resid features:
in
Token in
Feature activation+0.161
Top resid features:
Montana
Token Montana
Feature activation+0.153
Top resid features:
is
Token is
Feature activation+0.151
Top resid features:
helping
Token helping
Feature activation+0.240
Top resid features:
meet
Token meet
Feature activation+0.145
Top resid features:
the
Token the
Feature activation+0.406
Top resid features:
needs
Token needs
Feature activation+0.000
Top resid features:
of
Token of
Feature activation+0.000
Top resid features:
under
Token under
Feature activation+0.000
Top resid features:
priv
Tokenpriv
Feature activation+0.000
Top resid features:
ileged
Tokenileged
Feature activation+0.000
Top resid features:
months
Token months
Feature activation+0.150
Top resid features:
in
Token in
Feature activation+0.126
Top resid features:
a
Token a
Feature activation+0.119
Top resid features:
state
Token state
Feature activation+0.173
Top resid features:
of
Token of
Feature activation+0.138
Top resid features:
strategic
Token strategic
Feature activation+0.393
Top resid features:
paralysis
Token paralysis
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
wary
Token wary
Feature activation+0.000
Top resid features:
of
Token of
Feature activation+0.000
Top resid features:
making
Token making
Feature activation+0.000
Top resid features:
have
Token have
Feature activation+0.131
Top resid features:
spent
Token spent
Feature activation+0.138
Top resid features:
months
Token months
Feature activation+0.178
Top resid features:
in
Token in
Feature activation+0.180
Top resid features:
a
Token a
Feature activation+0.170
Top resid features:
state
Token state
Feature activation+0.442
Top resid features:
of
Token of
Feature activation+0.000
Top resid features:
strategic
Token strategic
Feature activation+0.000
Top resid features:
paralysis
Token paralysis
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
wary
Token wary
Feature activation+0.000
Top resid features:

Decoder Weights Distribution

Head 0: 0.05

Head 1: 0.12

Head 2: 0.07

Head 3: 0.03

Head 4: 0.07

Head 5: 0.06

Head 6: 0.04

Head 7: 0.11

Head 8: 0.19

Head 9: 0.09

Head 10: 0.06

Head 11: 0.11

Positive logits

Feminist1.31

versible1.21

ymm1.18

abel1.16

Femin1.13

imb1.13

ormon1.09

tube1.08

feminist1.08

1.08

Neph1.07

hod1.06

kat1.05

colonial1.04

Britain1.03

abet1.02

flower1.01

ARDIS1.01

femin1.01

minist1.00

Negative logits

scoreboard-1.46

coaches-1.45

lockout-1.41

Coliseum-1.40

kickoff-1.39

playoffs-1.35

handshake-1.32

playoff-1.28

touchdowns-1.25

basketball-1.25

league-1.22

Reese-1.21

stadiums-1.20

scorer-1.20

Oilers-1.19

scrimmage-1.18

championships-1.18

postseason-1.18

rebounds-1.16

leagues-1.15

INTERVAL 1.249 - 1.387
CONTAINS 0.000%

for
Token for
Feature activation+0.000
20
Token 20
Feature activation+0.000
teams
Token teams
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.445
This
TokenThis
Feature activation+1.061
organization
Token organization
Feature activation+1.387
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
teams
Token teams
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.445
This
TokenThis
Feature activation+1.061
organization
Token organization
Feature activation+1.387
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
organization
Token organization
Feature activation+1.387
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984

INTERVAL 1.110 - 1.249
CONTAINS 0.000%

Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984
ileged
Tokenileged
Feature activation+1.241
students
Token students
Feature activation+0.920
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984
ileged
Tokenileged
Feature activation+1.241
students
Token students
Feature activation+0.920
discreet
Token discreet
Feature activation+1.038
ly
Tokenly
Feature activation+0.542
and
Token and
Feature activation+0.546
respectfully
Token respectfully
Feature activation+0.742

INTERVAL 0.971 - 1.110
CONTAINS 0.001%

of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984
ileged
Tokenileged
Feature activation+1.241
students
Token students
Feature activation+0.920
discreet
Token discreet
Feature activation+1.038
ly
Tokenly
Feature activation+0.542
and
Token and
Feature activation+0.546
respectfully
Token respectfully
Feature activation+0.742
.
Token.
Feature activation+0.535
Ċ
TokenĊ
Feature activation+0.394
<|endoftext|>
Token<|endoftext|>
Feature activation+0.203
L
TokenL
Feature activation+0.705
ONDON
TokenONDON
Feature activation+0.781
âĢĶ
Token âĢĶ
Feature activation+0.884
Energy
Token Energy
Feature activation+0.821
companies
Token companies
Feature activation+1.033
have
Token have
Feature activation+0.956
spent
Token spent
Feature activation+0.875
months
Token months
Feature activation+0.802
in
Token in
Feature activation+0.695
a
Token a
Feature activation+0.722
This
TokenThis
Feature activation+1.061
organization
Token organization
Feature activation+1.387
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
<|endoftext|>
Token<|endoftext|>
Feature activation+0.445
This
TokenThis
Feature activation+1.061
organization
Token organization
Feature activation+1.387
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984
ileged
Tokenileged
Feature activation+1.241
students
Token students
Feature activation+0.920
discreet
Token discreet
Feature activation+1.038
ly
Tokenly
Feature activation+0.542
and
Token and
Feature activation+0.546

INTERVAL 0.832 - 0.971
CONTAINS 0.001%

have
Token have
Feature activation+0.956
spent
Token spent
Feature activation+0.875
months
Token months
Feature activation+0.802
in
Token in
Feature activation+0.695
a
Token a
Feature activation+0.722
state
Token state
Feature activation+0.885
of
Token of
Feature activation+0.489
strategic
Token strategic
Feature activation+0.898
paralysis
Token paralysis
Feature activation+0.744
,
Token,
Feature activation+0.496
wary
Token wary
Feature activation+0.512
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984
ileged
Tokenileged
Feature activation+1.241
students
Token students
Feature activation+0.920
discreet
Token discreet
Feature activation+1.038
ly
Tokenly
Feature activation+0.542
and
Token and
Feature activation+0.546
respectfully
Token respectfully
Feature activation+0.742
.
Token.
Feature activation+0.535
ONDON
TokenONDON
Feature activation+0.781
âĢĶ
Token âĢĶ
Feature activation+0.884
Energy
Token Energy
Feature activation+0.821
companies
Token companies
Feature activation+1.033
have
Token have
Feature activation+0.956
spent
Token spent
Feature activation+0.875
months
Token months
Feature activation+0.802
in
Token in
Feature activation+0.695
a
Token a
Feature activation+0.722
state
Token state
Feature activation+0.885
of
Token of
Feature activation+0.489
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902
needs
Token needs
Feature activation+1.196
of
Token of
Feature activation+0.765
under
Token under
Feature activation+0.870
priv
Tokenpriv
Feature activation+0.984
ileged
Tokenileged
Feature activation+1.241
20
Token 20
Feature activation+0.000
teams
Token teams
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.445
This
TokenThis
Feature activation+1.061
organization
Token organization
Feature activation+1.387
in
Token in
Feature activation+0.933
Montana
Token Montana
Feature activation+1.360
is
Token is
Feature activation+1.051
helping
Token helping
Feature activation+1.087
meet
Token meet
Feature activation+1.294
the
Token the
Feature activation+0.902

INTERVAL 0.694 - 0.832
CONTAINS 0.003%

That
TokenThat
Feature activation+0.673
is
Token is
Feature activation+0.673
not
Token not
Feature activation+0.536
the
Token the
Feature activation+0.488
way
Token way
Feature activation+0.592
Microsoft
Token Microsoft
Feature activation+0.750
views
Token views
Feature activation+0.576
things
Token things
Feature activation+0.556
.
Token.
Feature activation+0.470
Microsoft
Token Microsoft
Feature activation+0.532
always
Token always
Feature activation+0.395
league
Tokenleague
Feature activation+0.000
).
Token).
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
The
TokenThe
Feature activation+0.546
warnings
Token warnings
Feature activation+0.585
have
Token have
Feature activation+0.758
been
Token been
Feature activation+0.695
quick
Token quick
Feature activation+0.784
and
Token and
Feature activation+0.489
delivered
Token delivered
Feature activation+0.566
with
Token with
Feature activation+0.427
<|endoftext|>
Token<|endoftext|>
Feature activation+0.139
SE
TokenSE
Feature activation+0.518
V
TokenV
Feature activation+0.637
ENT
TokenENT
Feature activation+0.427
Y
TokenY
Feature activation+0.719
years
Token years
Feature activation+0.732
have
Token have
Feature activation+0.811
passed
Token passed
Feature activation+0.802
but
Token but
Feature activation+0.566
the
Token the
Feature activation+0.514
emotion
Token emotion
Feature activation+0.705
companies
Token companies
Feature activation+1.033
have
Token have
Feature activation+0.956
spent
Token spent
Feature activation+0.875
months
Token months
Feature activation+0.802
in
Token in
Feature activation+0.695
a
Token a
Feature activation+0.722
state
Token state
Feature activation+0.885
of
Token of
Feature activation+0.489
strategic
Token strategic
Feature activation+0.898
paralysis
Token paralysis
Feature activation+0.744
,
Token,
Feature activation+0.496
in
Token in
Feature activation+0.695
a
Token a
Feature activation+0.722
state
Token state
Feature activation+0.885
of
Token of
Feature activation+0.489
strategic
Token strategic
Feature activation+0.898
paralysis
Token paralysis
Feature activation+0.744
,
Token,
Feature activation+0.496
wary
Token wary
Feature activation+0.512
of
Token of
Feature activation+0.355
making
Token making
Feature activation+0.525
big
Token big
Feature activation+0.671

INTERVAL 0.555 - 0.694
CONTAINS 0.006%

not
Token not
Feature activation+0.536
the
Token the
Feature activation+0.488
way
Token way
Feature activation+0.592
Microsoft
Token Microsoft
Feature activation+0.750
views
Token views
Feature activation+0.576
things
Token things
Feature activation+0.556
.
Token.
Feature activation+0.470
Microsoft
Token Microsoft
Feature activation+0.532
always
Token always
Feature activation+0.395
assumes
Token assumes
Feature activation+0.340
they
Token they
Feature activation+0.356
Ange
Token Ange
Feature activation+0.710
rer
Tokenrer
Feature activation+0.469
/
Token/
Feature activation+0.441
Getty
TokenGetty
Feature activation+0.379
Images
Token Images
Feature activation+0.419
Senate
Token Senate
Feature activation+0.577
Majority
Token Majority
Feature activation+0.484
Leader
Token Leader
Feature activation+0.212
Mitch
Token Mitch
Feature activation+0.404
McConnell
Token McConnell
Feature activation+0.406
refuses
Token refuses
Feature activation+0.415
,
Token,
Feature activation+0.350
2017
Token 2017
Feature activation+0.673
at
Token at
Feature activation+0.550
11
Token 11
Feature activation+0.559
:
Token:
Feature activation+0.381
15
Token15
Feature activation+0.612
am
Token am
Feature activation+0.647
:
Token:
Feature activation+0.411
The
Token The
Feature activation+0.446
full
Token full
Feature activation+0.620
deck
Token deck
Feature activation+0.639
team
Tokenteam
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.075
P
TokenP
Feature activation+0.566
ent
Tokenent
Feature activation+0.425
agon
Tokenagon
Feature activation+0.730
officials
Token officials
Feature activation+0.606
have
Token have
Feature activation+0.392
repeatedly
Token repeatedly
Feature activation+0.441
made
Token made
Feature activation+0.409
clear
Token clear
Feature activation+0.525
in
Token in
Feature activation+0.285
ENT
TokenENT
Feature activation+0.427
Y
TokenY
Feature activation+0.719
years
Token years
Feature activation+0.732
have
Token have
Feature activation+0.811
passed
Token passed
Feature activation+0.802
but
Token but
Feature activation+0.566
the
Token the
Feature activation+0.514
emotion
Token emotion
Feature activation+0.705
remains
Token remains
Feature activation+0.619
for
Token for
Feature activation+0.390
Australian
Token Australian
Feature activation+0.642

INTERVAL 0.416 - 0.555
CONTAINS 0.011%

--
Token--
Feature activation+0.524
ton
Tokenton
Feature activation+0.651
er
Tokener
Feature activation+0.533
or
Token or
Feature activation+0.413
ink
Token ink
Feature activation+0.387
,
Token,
Feature activation+0.497
paper
Token paper
Feature activation+0.281
,
Token,
Feature activation+0.392
imaging
Token imaging
Feature activation+0.409
drums
Token drums
Feature activation+0.499
,
Token,
Feature activation+0.406
of
Token of
Feature activation+0.260
people
Token people
Feature activation+0.474
of
Token of
Feature activation+0.246
various
Token various
Feature activation+0.447
backgrounds
Token backgrounds
Feature activation+0.582
who
Token who
Feature activation+0.425
believe
Token believe
Feature activation+0.592
you
Token you
Feature activation+0.385
shouldn
Token shouldn
Feature activation+0.285
't
Token't
Feature activation+0.171
talk
Token talk
Feature activation+0.429
agon
Tokenagon
Feature activation+0.730
officials
Token officials
Feature activation+0.606
have
Token have
Feature activation+0.392
repeatedly
Token repeatedly
Feature activation+0.441
made
Token made
Feature activation+0.409
clear
Token clear
Feature activation+0.525
in
Token in
Feature activation+0.285
recent
Token recent
Feature activation+0.459
weeks
Token weeks
Feature activation+0.459
that
Token that
Feature activation+0.478
they
Token they
Feature activation+0.406
on
Token on
Feature activation+0.280
a
Token a
Feature activation+0.326
contract
Token contract
Feature activation+0.593
basis
Token basis
Feature activation+0.556
are
Token are
Feature activation+0.367
now
Token now
Feature activation+0.436
leading
Token leading
Feature activation+0.309
the
Token the
Feature activation+0.190
way
Token way
Feature activation+0.192
in
Token in
Feature activation+0.094
the
Token the
Feature activation+0.146
made
Token made
Feature activation+0.409
clear
Token clear
Feature activation+0.525
in
Token in
Feature activation+0.285
recent
Token recent
Feature activation+0.459
weeks
Token weeks
Feature activation+0.459
that
Token that
Feature activation+0.478
they
Token they
Feature activation+0.406
are
Token are
Feature activation+0.383
pushing
Token pushing
Feature activation+0.336
the
Token the
Feature activation+0.243
Obama
Token Obama
Feature activation+0.291

INTERVAL 0.277 - 0.416
CONTAINS 0.020%

<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
This
TokenThis
Feature activation+0.265
article
Token article
Feature activation+0.340
is
Token is
Feature activation+0.228
over
Token over
Feature activation+0.216
3
Token 3
Feature activation+0.388
years
Token years
Feature activation+0.410
old
Token old
Feature activation+0.513
Ċ
TokenĊ
Feature activation+0.097
Ċ
TokenĊ
Feature activation+0.024
Police
TokenPolice
Feature activation+0.155
I
Token I
Feature activation+0.000
get
Token get
Feature activation+0.000
a
Token a
Feature activation+0.000
very
Token very
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
There
TokenThere
Feature activation+0.365
are
Token are
Feature activation+0.409
a
Token a
Feature activation+0.281
lot
Token lot
Feature activation+0.521
of
Token of
Feature activation+0.260
people
Token people
Feature activation+0.474
counties
Token counties
Feature activation+0.036
in
Token in
Feature activation+0.084
western
Token western
Feature activation+0.217
Wisconsin
Token Wisconsin
Feature activation+0.121
have
Token have
Feature activation+0.224
become
Token become
Feature activation+0.292
difficult
Token difficult
Feature activation+0.334
to
Token to
Feature activation+0.107
predict
Token predict
Feature activation+0.196
politically
Token politically
Feature activation+0.271
.
Token.
Feature activation+0.115
Z
Token Z
Feature activation+0.257
ell
Tokenell
Feature activation+0.149
ner
Tokenner
Feature activation+0.203
is
Token is
Feature activation+0.183
a
Token a
Feature activation+0.160
PhD
Token PhD
Feature activation+0.296
student
Token student
Feature activation+0.135
in
Token in
Feature activation+0.004
the
Token the
Feature activation+0.000
Ed
Token Ed
Feature activation+0.030
Psych
Token Psych
Feature activation+0.006
<|endoftext|>
Token<|endoftext|>
Feature activation+0.083
During
TokenDuring
Feature activation+0.775
the
Token the
Feature activation+0.592
2016
Token 2016
Feature activation+0.696
presidential
Token presidential
Feature activation+0.484
campaign
Token campaign
Feature activation+0.406
,
Token,
Feature activation+0.274
Donald
Token Donald
Feature activation+0.402
Trump
Token Trump
Feature activation+0.262
presented
Token presented
Feature activation+0.338
a
Token a
Feature activation+0.196

INTERVAL 0.139 - 0.277
CONTAINS 0.022%

its
Token its
Feature activation+0.332
predecessor
Token predecessor
Feature activation+0.627
emerged
Token emerged
Feature activation+0.275
about
Token about
Feature activation+0.041
a
Token a
Feature activation+0.113
month
Token month
Feature activation+0.219
ago
Token ago
Feature activation+0.074
,
Token,
Feature activation+0.000
Apple
Token Apple
Feature activation+0.048
today
Token today
Feature activation+0.053
made
Token made
Feature activation+0.070
you
Tokenyou
Feature activation+0.583
may
Token may
Feature activation+0.317
find
Token find
Feature activation+0.308
that
Token that
Feature activation+0.280
a
Token a
Feature activation+0.311
printer
Token printer
Feature activation+0.267
is
Token is
Feature activation+0.260
one
Token one
Feature activation+0.243
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.135
most
Token most
Feature activation+0.087
is
Token is
Feature activation+0.260
one
Token one
Feature activation+0.243
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.135
most
Token most
Feature activation+0.087
expensive
Token expensive
Feature activation+0.149
pieces
Token pieces
Feature activation+0.143
of
Token of
Feature activation+0.000
IT
Token IT
Feature activation+0.002
equipment
Token equipment
Feature activation+0.015
in
Token in
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
Long
TokenLong
Feature activation+0.142
before
Token before
Feature activation+0.014
five
Token five
Feature activation+0.233
different
Token different
Feature activation+0.227
goal
Token goal
Feature activation+0.542
score
Token score
Feature activation+0.347
rs
Tokenrs
Feature activation+0.349
prompted
Token prompted
Feature activation+0.155
the
Token the
Feature activation+0.000
,
Token,
Feature activation+0.183
one
Token one
Feature activation+0.331
of
Token of
Feature activation+0.094
which
Token which
Feature activation+0.275
being
Token being
Feature activation+0.360
the
Token the
Feature activation+0.255
windows
Token windows
Feature activation+0.333
not
Token not
Feature activation+0.187
rolling
Token rolling
Feature activation+0.391
up
Token up
Feature activation+0.211
or
Token or
Feature activation+0.135

INTERVAL 0.000 - 0.139
CONTAINS 99.935%

.
Token.
Feature activation+0.000
Those
Token Those
Feature activation+0.000
changes
Token changes
Feature activation+0.000
in
Token in
Feature activation+0.000
sens
Token sens
Feature activation+0.000
ibility
Tokenibility
Feature activation+0.000
and
Token and
Feature activation+0.000
consciousness
Token consciousness
Feature activation+0.000
never
Token never
Feature activation+0.000
correspond
Token correspond
Feature activation+0.000
exactly
Token exactly
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Then
TokenThen
Feature activation+0.000
you
Token you
Feature activation+0.000
are
Token are
Feature activation+0.000
probably
Token probably
Feature activation+0.000
running
Token running
Feature activation+0.000
with
Token with
Feature activation+0.000
the
Token the
Feature activation+0.000
vulnerable
Token vulnerable
Feature activation+0.000
feature
Token feature
Feature activation+0.000
enabled
Token enabled
Feature activation+0.000
list
Token list
Feature activation+0.000
of
Token of
Feature activation+0.000
best
Token best
Feature activation+0.000
smartphones
Token smartphones
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Best
TokenBest
Feature activation+0.000
Wear
Token Wear
Feature activation+0.000
able
Tokenable
Feature activation+0.000
Tech
Token Tech
Feature activation+0.000
Trek
Token Trek
Feature activation+0.000
love
Token love
Feature activation+0.000
was
Token was
Feature activation+0.000
the
Token the
Feature activation+0.000
Original
Token Original
Feature activation+0.000
Series
Token Series
Feature activation+0.000
.
Token.
Feature activation+0.000
Perfect
Token Perfect
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Copyright
Token Copyright
Feature activation+0.000
2014
Token 2014
Feature activation+0.000
CBS
Token CBS
Feature activation+0.000
Broadcasting
Token Broadcasting
Feature activation+0.000
Inc
Token Inc
Feature activation+0.000
.
Token.
Feature activation+0.000
Used
Token Used
Feature activation+0.000
under
Token under
Feature activation+0.000
license
Token license
Feature activation+0.000
.
Token.
Feature activation+0.000
All
Token All
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 2 in H1.8: (feature 8083

TOP ACTIVATIONS
MAX = 1.463

isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
the
Token the
Feature activation+0.770
Russian
Token Russian
Feature activation+0.890
rock
Token rock
Feature activation+1.059
band
Token band
Feature activation+0.792
Modi
Token Modi
Feature activation+0.017
later
Token later
Feature activation+0.000
in
Token in
Feature activation+0.000
October
Token October
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.398
Al
TokenAl
Feature activation+1.264
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
later
Token later
Feature activation+0.000
in
Token in
Feature activation+0.000
October
Token October
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.398
Al
TokenAl
Feature activation+1.264
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
October
Token October
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.398
Al
TokenAl
Feature activation+1.264
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
in
Token in
Feature activation+0.000
October
Token October
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.398
Al
TokenAl
Feature activation+1.264
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
<|endoftext|>
Token<|endoftext|>
Feature activation+1.398
Al
TokenAl
Feature activation+1.264
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
the
Token the
Feature activation+0.770
Russian
Token Russian
Feature activation+0.890
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
the
Token the
Feature activation+0.770
Russian
Token Russian
Feature activation+0.890
rock
Token rock
Feature activation+1.059
band
Token band
Feature activation+0.792
L
Token L
Feature activation+0.497
ening
Tokenening
Feature activation+0.612
rad
Tokenrad
Feature activation+0.746
,
Token,
Feature activation+0.572
Editing
Token Editing
Feature activation+0.000
by
Token by
Feature activation+0.000
Matthew
Token Matthew
Feature activation+0.000
Jones
Token Jones
Feature activation+0.000
)
Token)
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.046
WASHINGTON
TokenWASHINGTON
Feature activation+0.730
âĢĶ
TokenâĢĶ
Feature activation+0.891
Congress
TokenCongress
Feature activation+0.834
ional
Tokenional
Feature activation+0.574
Republicans
Token Republicans
Feature activation+0.854
new
Token new
Feature activation+0.844
investigation
Token investigation
Feature activation+0.827
into
Token into
Feature activation+0.559
the
Token the
Feature activation+0.766
troubled
Token troubled
Feature activation+0.894
rollout
Token rollout
Feature activation+0.988
of
Token of
Feature activation+0.599
President
Token President
Feature activation+0.699
Barack
Token Barack
Feature activation+0.714
Obama
Token Obama
Feature activation+0.647
âĢ
TokenâĢ
Feature activation+0.727
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.398
Al
TokenAl
Feature activation+1.264
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
the
Token the
Feature activation+0.770
Al
TokenAl
Feature activation+1.264
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
the
Token the
Feature activation+0.770
Russian
Token Russian
Feature activation+0.890
rock
Token rock
Feature activation+1.059
a
Token a
Feature activation+0.792
new
Token new
Feature activation+0.844
investigation
Token investigation
Feature activation+0.827
into
Token into
Feature activation+0.559
the
Token the
Feature activation+0.766
troubled
Token troubled
Feature activation+0.894
rollout
Token rollout
Feature activation+0.988
of
Token of
Feature activation+0.599
President
Token President
Feature activation+0.699
Barack
Token Barack
Feature activation+0.714
Obama
Token Obama
Feature activation+0.647
Matthew
Token Matthew
Feature activation+0.000
Jones
Token Jones
Feature activation+0.000
)
Token)
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.046
WASHINGTON
TokenWASHINGTON
Feature activation+0.730
âĢĶ
TokenâĢĶ
Feature activation+0.891
Congress
TokenCongress
Feature activation+0.834
ional
Tokenional
Feature activation+0.574
Republicans
Token Republicans
Feature activation+0.854
on
Token on
Feature activation+0.610
Tuesday
Token Tuesday
Feature activation+0.794
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
the
Token the
Feature activation+0.770
Russian
Token Russian
Feature activation+0.890
rock
Token rock
Feature activation+1.059
band
Token band
Feature activation+0.792
L
Token L
Feature activation+0.497
ening
Tokenening
Feature activation+0.612
rad
Tokenrad
Feature activation+0.746
<|endoftext|>
Token<|endoftext|>
Feature activation+1.046
WASHINGTON
TokenWASHINGTON
Feature activation+0.730
âĢĶ
TokenâĢĶ
Feature activation+0.891
Congress
TokenCongress
Feature activation+0.834
ional
Tokenional
Feature activation+0.574
Republicans
Token Republicans
Feature activation+0.854
on
Token on
Feature activation+0.610
Tuesday
Token Tuesday
Feature activation+0.794
announced
Token announced
Feature activation+0.785
a
Token a
Feature activation+0.792
new
Token new
Feature activation+0.844
Republicans
Token Republicans
Feature activation+0.854
on
Token on
Feature activation+0.610
Tuesday
Token Tuesday
Feature activation+0.794
announced
Token announced
Feature activation+0.785
a
Token a
Feature activation+0.792
new
Token new
Feature activation+0.844
investigation
Token investigation
Feature activation+0.827
into
Token into
Feature activation+0.559
the
Token the
Feature activation+0.766
troubled
Token troubled
Feature activation+0.894
rollout
Token rollout
Feature activation+0.988
Jones
Token Jones
Feature activation+0.000
)
Token)
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.046
WASHINGTON
TokenWASHINGTON
Feature activation+0.730
âĢĶ
TokenâĢĶ
Feature activation+0.891
Congress
TokenCongress
Feature activation+0.834
ional
Tokenional
Feature activation+0.574
Republicans
Token Republicans
Feature activation+0.854
on
Token on
Feature activation+0.610
Tuesday
Token Tuesday
Feature activation+0.794
announced
Token announced
Feature activation+0.785
on
Token on
Feature activation+0.610
Tuesday
Token Tuesday
Feature activation+0.794
announced
Token announced
Feature activation+0.785
a
Token a
Feature activation+0.792
new
Token new
Feature activation+0.844
investigation
Token investigation
Feature activation+0.827
into
Token into
Feature activation+0.559
the
Token the
Feature activation+0.766
troubled
Token troubled
Feature activation+0.894
rollout
Token rollout
Feature activation+0.988
of
Token of
Feature activation+0.599
âĢĶ
TokenâĢĶ
Feature activation+0.891
Congress
TokenCongress
Feature activation+0.834
ional
Tokenional
Feature activation+0.574
Republicans
Token Republicans
Feature activation+0.854
on
Token on
Feature activation+0.610
Tuesday
Token Tuesday
Feature activation+0.794
announced
Token announced
Feature activation+0.785
a
Token a
Feature activation+0.792
new
Token new
Feature activation+0.844
investigation
Token investigation
Feature activation+0.827
into
Token into
Feature activation+0.559
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
the
Token the
Feature activation+0.770
Russian
Token Russian
Feature activation+0.890
rock
Token rock
Feature activation+1.059
band
Token band
Feature activation+0.792
L
Token L
Feature activation+0.497

Top DFA by src position
MAX = 1.312

Modi
Token Modi
Feature activation+0.336
Top resid features:
later
Token later
Feature activation+0.062
Top resid features:
in
Token in
Feature activation+0.043
Top resid features:
October
Token October
Feature activation+0.053
Top resid features:
.
Token.
Feature activation+0.124
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.581
Top resid features:
Al
TokenAl
Feature activation+0.152
Top resid features:
isa
Tokenisa
Feature activation+0.168
Top resid features:
Vox
Token Vox
Feature activation+0.278
Top resid features:
,
Token,
Feature activation+0.297
Top resid features:
formerly
Token formerly
Feature activation+0.209
Top resid features:
Modi
Token Modi
Feature activation+0.514
Top resid features:
later
Token later
Feature activation+0.095
Top resid features:
in
Token in
Feature activation+0.065
Top resid features:
October
Token October
Feature activation+0.130
Top resid features:
.
Token.
Feature activation+0.225
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+1.144
Top resid features:
Al
TokenAl
Feature activation+0.000
Top resid features:
isa
Tokenisa
Feature activation+0.000
Top resid features:
Vox
Token Vox
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
formerly
Token formerly
Feature activation+0.000
Top resid features:
Modi
Token Modi
Feature activation+0.303
Top resid features:
later
Token later
Feature activation+0.097
Top resid features:
in
Token in
Feature activation+0.066
Top resid features:
October
Token October
Feature activation+0.066
Top resid features:
.
Token.
Feature activation+0.170
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+1.312
Top resid features:
Al
TokenAl
Feature activation+0.228
Top resid features:
isa
Tokenisa
Feature activation+0.000
Top resid features:
Vox
Token Vox
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
formerly
Token formerly
Feature activation+0.000
Top resid features:
Modi
Token Modi
Feature activation+0.357
Top resid features:
later
Token later
Feature activation+0.082
Top resid features:
in
Token in
Feature activation+0.045
Top resid features:
October
Token October
Feature activation+0.056
Top resid features:
.
Token.
Feature activation+0.177
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.973
Top resid features:
Al
TokenAl
Feature activation+0.214
Top resid features:
isa
Tokenisa
Feature activation+0.337
Top resid features:
Vox
Token Vox
Feature activation+0.188
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
formerly
Token formerly
Feature activation+0.000
Top resid features:
Modi
Token Modi
Feature activation+0.287
Top resid features:
later
Token later
Feature activation+0.087
Top resid features:
in
Token in
Feature activation+0.056
Top resid features:
October
Token October
Feature activation+0.067
Top resid features:
.
Token.
Feature activation+0.178
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+1.001
Top resid features:
Al
TokenAl
Feature activation+0.253
Top resid features:
isa
Tokenisa
Feature activation+0.308
Top resid features:
Vox
Token Vox
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
formerly
Token formerly
Feature activation+0.000
Top resid features:
Modi
Token Modi
Feature activation+0.281
Top resid features:
later
Token later
Feature activation+0.089
Top resid features:
in
Token in
Feature activation+0.060
Top resid features:
October
Token October
Feature activation+0.063
Top resid features:
.
Token.
Feature activation+0.153
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.673
Top resid features:
Al
TokenAl
Feature activation+0.165
Top resid features:
isa
Tokenisa
Feature activation+0.249
Top resid features:
Vox
Token Vox
Feature activation+0.326
Top resid features:
,
Token,
Feature activation+0.397
Top resid features:
formerly
Token formerly
Feature activation+0.240
Top resid features:
Modi
Token Modi
Feature activation+0.333
Top resid features:
later
Token later
Feature activation+0.061
Top resid features:
in
Token in
Feature activation+0.040
Top resid features:
October
Token October
Feature activation+0.048
Top resid features:
.
Token.
Feature activation+0.122
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.463
Top resid features:
Al
TokenAl
Feature activation+0.102
Top resid features:
isa
Tokenisa
Feature activation+0.135
Top resid features:
Vox
Token Vox
Feature activation+0.174
Top resid features:
,
Token,
Feature activation+0.191
Top resid features:
formerly
Token formerly
Feature activation+0.154
Top resid features:
Editing
Token Editing
Feature activation+0.065
Top resid features:
by
Token by
Feature activation+0.074
Top resid features:
Matthew
Token Matthew
Feature activation+0.105
Top resid features:
Jones
Token Jones
Feature activation+0.092
Top resid features:
)
Token)
Feature activation+0.142
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.905
Top resid features:
WASHINGTON
TokenWASHINGTON
Feature activation+0.000
Top resid features:
âĢĶ
TokenâĢĶ
Feature activation+0.000
Top resid features:
Congress
TokenCongress
Feature activation+0.000
Top resid features:
ional
Tokenional
Feature activation+0.000
Top resid features:
Republicans
Token Republicans
Feature activation+0.000
Top resid features:
Editing
Token Editing
Feature activation+0.021
Top resid features:
by
Token by
Feature activation+0.032
Top resid features:
Matthew
Token Matthew
Feature activation+0.035
Top resid features:
Jones
Token Jones
Feature activation+0.037
Top resid features:
)
Token)
Feature activation+0.064
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.278
Top resid features:
WASHINGTON
TokenWASHINGTON
Feature activation+0.061
Top resid features:
âĢĶ
TokenâĢĶ
Feature activation+0.167
Top resid features:
Congress
TokenCongress
Feature activation+0.112
Top resid features:
ional
Tokenional
Feature activation+0.084
Top resid features:
Republicans
Token Republicans
Feature activation+0.118
Top resid features:
Modi
Token Modi
Feature activation+0.235
Top resid features:
later
Token later
Feature activation+0.086
Top resid features:
in
Token in
Feature activation+0.075
Top resid features:
October
Token October
Feature activation+0.075
Top resid features:
.
Token.
Feature activation+0.198
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.735
Top resid features:
Al
TokenAl
Feature activation+0.194
Top resid features:
isa
Tokenisa
Feature activation+0.304
Top resid features:
Vox
Token Vox
Feature activation+0.385
Top resid features:
,
Token,
Feature activation+0.430
Top resid features:
formerly
Token formerly
Feature activation+0.000
Top resid features:
Modi
Token Modi
Feature activation+0.189
Top resid features:
later
Token later
Feature activation+0.081
Top resid features:
in
Token in
Feature activation+0.068
Top resid features:
October
Token October
Feature activation+0.083
Top resid features:
.
Token.
Feature activation+0.158
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.568
Top resid features:
Al
TokenAl
Feature activation+0.171
Top resid features:
isa
Tokenisa
Feature activation+0.225
Top resid features:
Vox
Token Vox
Feature activation+0.298
Top resid features:
,
Token,
Feature activation+0.369
Top resid features:
formerly
Token formerly
Feature activation+0.228
Top resid features:
Editing
Token Editing
Feature activation+0.025
Top resid features:
by
Token by
Feature activation+0.033
Top resid features:
Matthew
Token Matthew
Feature activation+0.036
Top resid features:
Jones
Token Jones
Feature activation+0.037
Top resid features:
)
Token)
Feature activation+0.064
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.285
Top resid features:
WASHINGTON
TokenWASHINGTON
Feature activation+0.064
Top resid features:
âĢĶ
TokenâĢĶ
Feature activation+0.176
Top resid features:
Congress
TokenCongress
Feature activation+0.080
Top resid features:
ional
Tokenional
Feature activation+0.090
Top resid features:
Republicans
Token Republicans
Feature activation+0.129
Top resid features:
Editing
Token Editing
Feature activation+0.041
Top resid features:
by
Token by
Feature activation+0.063
Top resid features:
Matthew
Token Matthew
Feature activation+0.090
Top resid features:
Jones
Token Jones
Feature activation+0.080
Top resid features:
)
Token)
Feature activation+0.127
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.756
Top resid features:
WASHINGTON
TokenWASHINGTON
Feature activation+0.188
Top resid features:
âĢĶ
TokenâĢĶ
Feature activation+0.394
Top resid features:
Congress
TokenCongress
Feature activation+0.000
Top resid features:
ional
Tokenional
Feature activation+0.000
Top resid features:
Republicans
Token Republicans
Feature activation+0.000
Top resid features:
Modi
Token Modi
Feature activation+0.233
Top resid features:
later
Token later
Feature activation+0.055
Top resid features:
in
Token in
Feature activation+0.040
Top resid features:
October
Token October
Feature activation+0.052
Top resid features:
.
Token.
Feature activation+0.141
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.470
Top resid features:
Al
TokenAl
Feature activation+0.115
Top resid features:
isa
Tokenisa
Feature activation+0.172
Top resid features:
Vox
Token Vox
Feature activation+0.254
Top resid features:
,
Token,
Feature activation+0.228
Top resid features:
formerly
Token formerly
Feature activation+0.188
Top resid features:
Editing
Token Editing
Feature activation+0.041
Top resid features:
by
Token by
Feature activation+0.054
Top resid features:
Matthew
Token Matthew
Feature activation+0.069
Top resid features:
Jones
Token Jones
Feature activation+0.071
Top resid features:
)
Token)
Feature activation+0.104
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.600
Top resid features:
WASHINGTON
TokenWASHINGTON
Feature activation+0.134
Top resid features:
âĢĶ
TokenâĢĶ
Feature activation+0.284
Top resid features:
Congress
TokenCongress
Feature activation+0.065
Top resid features:
ional
Tokenional
Feature activation+0.183
Top resid features:
Republicans
Token Republicans
Feature activation+0.162
Top resid features:
Editing
Token Editing
Feature activation+0.025
Top resid features:
by
Token by
Feature activation+0.041
Top resid features:
Matthew
Token Matthew
Feature activation+0.049
Top resid features:
Jones
Token Jones
Feature activation+0.049
Top resid features:
)
Token)
Feature activation+0.082
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.373
Top resid features:
WASHINGTON
TokenWASHINGTON
Feature activation+0.079
Top resid features:
âĢĶ
TokenâĢĶ
Feature activation+0.207
Top resid features:
Congress
TokenCongress
Feature activation+0.086
Top resid features:
ional
Tokenional
Feature activation+0.117
Top resid features:
Republicans
Token Republicans
Feature activation+0.169
Top resid features:
Editing
Token Editing
Feature activation+0.044
Top resid features:
by
Token by
Feature activation+0.058
Top resid features:
Matthew
Token Matthew
Feature activation+0.079
Top resid features:
Jones
Token Jones
Feature activation+0.080
Top resid features:
)
Token)
Feature activation+0.114
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.800
Top resid features:
WASHINGTON
TokenWASHINGTON
Feature activation+0.159
Top resid features:
âĢĶ
TokenâĢĶ
Feature activation+0.381
Top resid features:
Congress
TokenCongress
Feature activation+0.060
Top resid features:
ional
Tokenional
Feature activation+0.000
Top resid features:
Republicans
Token Republicans
Feature activation+0.000
Top resid features:
Editing
Token Editing
Feature activation+0.030
Top resid features:
by
Token by
Feature activation+0.038
Top resid features:
Matthew
Token Matthew
Feature activation+0.045
Top resid features:
Jones
Token Jones
Feature activation+0.046
Top resid features:
)
Token)
Feature activation+0.071
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.361
Top resid features:
WASHINGTON
TokenWASHINGTON
Feature activation+0.084
Top resid features:
âĢĶ
TokenâĢĶ
Feature activation+0.169
Top resid features:
Congress
TokenCongress
Feature activation+0.099
Top resid features:
ional
Tokenional
Feature activation+0.095
Top resid features:
Republicans
Token Republicans
Feature activation+0.152
Top resid features:
Editing
Token Editing
Feature activation+0.039
Top resid features:
by
Token by
Feature activation+0.048
Top resid features:
Matthew
Token Matthew
Feature activation+0.059
Top resid features:
Jones
Token Jones
Feature activation+0.058
Top resid features:
)
Token)
Feature activation+0.108
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.496
Top resid features:
WASHINGTON
TokenWASHINGTON
Feature activation+0.104
Top resid features:
âĢĶ
TokenâĢĶ
Feature activation+0.284
Top resid features:
Congress
TokenCongress
Feature activation+0.086
Top resid features:
ional
Tokenional
Feature activation+0.151
Top resid features:
Republicans
Token Republicans
Feature activation+0.185
Top resid features:
Modi
Token Modi
Feature activation+0.211
Top resid features:
later
Token later
Feature activation+0.080
Top resid features:
in
Token in
Feature activation+0.055
Top resid features:
October
Token October
Feature activation+0.080
Top resid features:
.
Token.
Feature activation+0.152
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.506
Top resid features:
Al
TokenAl
Feature activation+0.140
Top resid features:
isa
Tokenisa
Feature activation+0.182
Top resid features:
Vox
Token Vox
Feature activation+0.243
Top resid features:
,
Token,
Feature activation+0.285
Top resid features:
formerly
Token formerly
Feature activation+0.198
Top resid features:

Decoder Weights Distribution

Head 0: 0.04

Head 1: 0.11

Head 2: 0.06

Head 3: 0.03

Head 4: 0.06

Head 5: 0.07

Head 6: 0.06

Head 7: 0.13

Head 8: 0.19

Head 9: 0.11

Head 10: 0.05

Head 11: 0.08

Positive logits

ULTS1.17

Bundesliga1.15

1.13

efeated1.12

lations1.09

Blizz1.08

Ange1.06

utical1.04

Odin1.02

Soccer1.02

StarCraft1.01

eah1.01

sibling1.00

esports0.99

0.99

Winged0.98

ISTORY0.97

FACE0.97

Origin0.97

descendants0.96

Negative logits

chenko-1.09

unsub-1.09

bott-1.08

selves-1.07

sergeant-1.05

brig-1.04

jee-1.03

croft-1.02

Gors-1.01

��-1.01

diss-1.01

uti-0.99

polic-0.98

osuke-0.98

gha-0.96

oub-0.96

misrepresent-0.96

Breach-0.95

Mahm-0.95

ebus-0.95

INTERVAL 1.317 - 1.463
CONTAINS 0.000%

Modi
Token Modi
Feature activation+0.017
later
Token later
Feature activation+0.000
in
Token in
Feature activation+0.000
October
Token October
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.398
Al
TokenAl
Feature activation+1.264
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
the
Token the
Feature activation+0.770
Russian
Token Russian
Feature activation+0.890
rock
Token rock
Feature activation+1.059
band
Token band
Feature activation+0.792

INTERVAL 1.171 - 1.317
CONTAINS 0.000%

later
Token later
Feature activation+0.000
in
Token in
Feature activation+0.000
October
Token October
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.398
Al
TokenAl
Feature activation+1.264
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
in
Token in
Feature activation+0.000
October
Token October
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.398
Al
TokenAl
Feature activation+1.264
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
October
Token October
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.398
Al
TokenAl
Feature activation+1.264
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794

INTERVAL 1.024 - 1.171
CONTAINS 0.000%

Editing
Token Editing
Feature activation+0.000
by
Token by
Feature activation+0.000
Matthew
Token Matthew
Feature activation+0.000
Jones
Token Jones
Feature activation+0.000
)
Token)
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.046
WASHINGTON
TokenWASHINGTON
Feature activation+0.730
âĢĶ
TokenâĢĶ
Feature activation+0.891
Congress
TokenCongress
Feature activation+0.834
ional
Tokenional
Feature activation+0.574
Republicans
Token Republicans
Feature activation+0.854
<|endoftext|>
Token<|endoftext|>
Feature activation+1.398
Al
TokenAl
Feature activation+1.264
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
the
Token the
Feature activation+0.770
Russian
Token Russian
Feature activation+0.890
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
the
Token the
Feature activation+0.770
Russian
Token Russian
Feature activation+0.890
rock
Token rock
Feature activation+1.059
band
Token band
Feature activation+0.792
L
Token L
Feature activation+0.497
ening
Tokenening
Feature activation+0.612
rad
Tokenrad
Feature activation+0.746
,
Token,
Feature activation+0.572

INTERVAL 0.878 - 1.024
CONTAINS 0.001%

new
Token new
Feature activation+0.844
investigation
Token investigation
Feature activation+0.827
into
Token into
Feature activation+0.559
the
Token the
Feature activation+0.766
troubled
Token troubled
Feature activation+0.894
rollout
Token rollout
Feature activation+0.988
of
Token of
Feature activation+0.599
President
Token President
Feature activation+0.699
Barack
Token Barack
Feature activation+0.714
Obama
Token Obama
Feature activation+0.647
âĢ
TokenâĢ
Feature activation+0.727
a
Token a
Feature activation+0.792
new
Token new
Feature activation+0.844
investigation
Token investigation
Feature activation+0.827
into
Token into
Feature activation+0.559
the
Token the
Feature activation+0.766
troubled
Token troubled
Feature activation+0.894
rollout
Token rollout
Feature activation+0.988
of
Token of
Feature activation+0.599
President
Token President
Feature activation+0.699
Barack
Token Barack
Feature activation+0.714
Obama
Token Obama
Feature activation+0.647
Matthew
Token Matthew
Feature activation+0.000
Jones
Token Jones
Feature activation+0.000
)
Token)
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.046
WASHINGTON
TokenWASHINGTON
Feature activation+0.730
âĢĶ
TokenâĢĶ
Feature activation+0.891
Congress
TokenCongress
Feature activation+0.834
ional
Tokenional
Feature activation+0.574
Republicans
Token Republicans
Feature activation+0.854
on
Token on
Feature activation+0.610
Tuesday
Token Tuesday
Feature activation+0.794
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
the
Token the
Feature activation+0.770
Russian
Token Russian
Feature activation+0.890
rock
Token rock
Feature activation+1.059
band
Token band
Feature activation+0.792
L
Token L
Feature activation+0.497
ening
Tokenening
Feature activation+0.612
rad
Tokenrad
Feature activation+0.746
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.398
Al
TokenAl
Feature activation+1.264
isa
Tokenisa
Feature activation+1.249
Vox
Token Vox
Feature activation+1.250
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
the
Token the
Feature activation+0.770

INTERVAL 0.732 - 0.878
CONTAINS 0.002%

rad
Tokenrad
Feature activation+0.746
,
Token,
Feature activation+0.572
released
Token released
Feature activation+0.793
a
Token a
Feature activation+0.629
new
Token new
Feature activation+0.711
music
Token music
Feature activation+0.759
video
Token video
Feature activation+0.654
on
Token on
Feature activation+0.286
Monday
Token Monday
Feature activation+0.641
,
Token,
Feature activation+0.491
titled
Token titled
Feature activation+0.767
music
Token music
Feature activation+0.759
video
Token video
Feature activation+0.654
on
Token on
Feature activation+0.286
Monday
Token Monday
Feature activation+0.641
,
Token,
Feature activation+0.491
titled
Token titled
Feature activation+0.767
âĢ
Token âĢ
Feature activation+0.516
ľ
Tokenľ
Feature activation+0.280
Baby
TokenBaby
Feature activation+0.617
Boy
Token Boy
Feature activation+0.367
.
Token.
Feature activation+0.587
,
Token,
Feature activation+0.963
formerly
Token formerly
Feature activation+1.169
a
Token a
Feature activation+0.914
singer
Token singer
Feature activation+1.463
in
Token in
Feature activation+0.794
the
Token the
Feature activation+0.770
Russian
Token Russian
Feature activation+0.890
rock
Token rock
Feature activation+1.059
band
Token band
Feature activation+0.792
L
Token L
Feature activation+0.497
ening
Tokenening
Feature activation+0.612
Congress
TokenCongress
Feature activation+0.834
ional
Tokenional
Feature activation+0.574
Republicans
Token Republicans
Feature activation+0.854
on
Token on
Feature activation+0.610
Tuesday
Token Tuesday
Feature activation+0.794
announced
Token announced
Feature activation+0.785
a
Token a
Feature activation+0.792
new
Token new
Feature activation+0.844
investigation
Token investigation
Feature activation+0.827
into
Token into
Feature activation+0.559
the
Token the
Feature activation+0.766
Jones
Token Jones
Feature activation+0.000
)
Token)
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+1.046
WASHINGTON
TokenWASHINGTON
Feature activation+0.730
âĢĶ
TokenâĢĶ
Feature activation+0.891
Congress
TokenCongress
Feature activation+0.834
ional
Tokenional
Feature activation+0.574
Republicans
Token Republicans
Feature activation+0.854
on
Token on
Feature activation+0.610
Tuesday
Token Tuesday
Feature activation+0.794
announced
Token announced
Feature activation+0.785

INTERVAL 0.585 - 0.732
CONTAINS 0.003%

rollout
Token rollout
Feature activation+0.988
of
Token of
Feature activation+0.599
President
Token President
Feature activation+0.699
Barack
Token Barack
Feature activation+0.714
Obama
Token Obama
Feature activation+0.647
âĢ
TokenâĢ
Feature activation+0.727
Ļ
TokenĻ
Feature activation+0.342
s
Tokens
Feature activation+0.505
health
Token health
Feature activation+0.601
care
Token care
Feature activation+0.473
reforms
Token reforms
Feature activation+0.577
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.362
Along
TokenAlong
Feature activation+0.391
with
Token with
Feature activation+0.558
the
Token the
Feature activation+0.587
overall
Token overall
Feature activation+0.621
chances
Token chances
Feature activation+0.629
of
Token of
Feature activation+0.387
winning
Token winning
Feature activation+0.526
the
Token the
Feature activation+0.481
election
Token election
Feature activation+0.569
players
Token players
Feature activation+0.000
like
Token like
Feature activation+0.000
3
Token 3
Feature activation+0.000
M
TokenM
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.671
At
TokenAt
Feature activation+0.577
least
Token least
Feature activation+0.654
12
Token 12
Feature activation+0.552
people
Token people
Feature activation+0.492
,
Token,
Feature activation+0.403
Barack
Token Barack
Feature activation+0.714
Obama
Token Obama
Feature activation+0.647
âĢ
TokenâĢ
Feature activation+0.727
Ļ
TokenĻ
Feature activation+0.342
s
Tokens
Feature activation+0.505
health
Token health
Feature activation+0.601
care
Token care
Feature activation+0.473
reforms
Token reforms
Feature activation+0.577
,
Token,
Feature activation+0.455
aimed
Token aimed
Feature activation+0.570
at
Token at
Feature activation+0.438
<|endoftext|>
Token<|endoftext|>
Feature activation+0.362
Along
TokenAlong
Feature activation+0.391
with
Token with
Feature activation+0.558
the
Token the
Feature activation+0.587
overall
Token overall
Feature activation+0.621
chances
Token chances
Feature activation+0.629
of
Token of
Feature activation+0.387
winning
Token winning
Feature activation+0.526
the
Token the
Feature activation+0.481
election
Token election
Feature activation+0.569
,
Token,
Feature activation+0.511

INTERVAL 0.439 - 0.585
CONTAINS 0.005%

of
Token of
Feature activation+0.000
blood
Token blood
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.362
In
TokenIn
Feature activation+0.363
a
Token a
Feature activation+0.449
medical
Token medical
Feature activation+0.622
first
Token first
Feature activation+0.578
,
Token,
Feature activation+0.502
doctors
Token doctors
Feature activation+0.573
at
Token at
Feature activation+0.372
with
Token with
Feature activation+0.558
the
Token the
Feature activation+0.587
overall
Token overall
Feature activation+0.621
chances
Token chances
Feature activation+0.629
of
Token of
Feature activation+0.387
winning
Token winning
Feature activation+0.526
the
Token the
Feature activation+0.481
election
Token election
Feature activation+0.569
,
Token,
Feature activation+0.511
Nate
Token Nate
Feature activation+0.395
Silver
Token Silver
Feature activation+0.424
Great
Token Great
Feature activation+0.428
Or
Token Or
Feature activation+0.577
mond
Tokenmond
Feature activation+0.640
Street
Token Street
Feature activation+0.521
hospital
Token hospital
Feature activation+0.354
believe
Token believe
Feature activation+0.550
they
Token they
Feature activation+0.484
cured
Token cured
Feature activation+0.289
two
Token two
Feature activation+0.283
babies
Token babies
Feature activation+0.353
of
Token of
Feature activation+0.204
overall
Token overall
Feature activation+0.621
chances
Token chances
Feature activation+0.629
of
Token of
Feature activation+0.387
winning
Token winning
Feature activation+0.526
the
Token the
Feature activation+0.481
election
Token election
Feature activation+0.569
,
Token,
Feature activation+0.511
Nate
Token Nate
Feature activation+0.395
Silver
Token Silver
Feature activation+0.424
's
Token's
Feature activation+0.373
Five
Token Five
Feature activation+0.419
President
Token President
Feature activation+0.699
Barack
Token Barack
Feature activation+0.714
Obama
Token Obama
Feature activation+0.647
âĢ
TokenâĢ
Feature activation+0.727
Ļ
TokenĻ
Feature activation+0.342
s
Tokens
Feature activation+0.505
health
Token health
Feature activation+0.601
care
Token care
Feature activation+0.473
reforms
Token reforms
Feature activation+0.577
,
Token,
Feature activation+0.455
aimed
Token aimed
Feature activation+0.570

INTERVAL 0.293 - 0.439
CONTAINS 0.010%

man
Token man
Feature activation+0.475
for
Token for
Feature activation+0.119
participating
Token participating
Feature activation+0.160
in
Token in
Feature activation+0.000
political
Token political
Feature activation+0.055
protests
Token protests
Feature activation+0.295
,
Token,
Feature activation+0.096
describing
Token describing
Feature activation+0.152
him
Token him
Feature activation+0.193
as
Token as
Feature activation+0.000
a
Token a
Feature activation+0.085
trial
Token trial
Feature activation+0.000
against
Token against
Feature activation+0.000
U
Token U
Feature activation+0.000
CC
TokenCC
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.362
Along
TokenAlong
Feature activation+0.391
with
Token with
Feature activation+0.558
the
Token the
Feature activation+0.587
overall
Token overall
Feature activation+0.621
chances
Token chances
Feature activation+0.629
Along
TokenAlong
Feature activation+0.391
with
Token with
Feature activation+0.558
the
Token the
Feature activation+0.587
overall
Token overall
Feature activation+0.621
chances
Token chances
Feature activation+0.629
of
Token of
Feature activation+0.387
winning
Token winning
Feature activation+0.526
the
Token the
Feature activation+0.481
election
Token election
Feature activation+0.569
,
Token,
Feature activation+0.511
Nate
Token Nate
Feature activation+0.395
<|endoftext|>
Token<|endoftext|>
Feature activation+0.559
The
TokenThe
Feature activation+0.375
Sur
Token Sur
Feature activation+0.089
prising
Tokenprising
Feature activation+0.291
Story
Token Story
Feature activation+0.362
Of
Token Of
Feature activation+0.298
'
Token '
Feature activation+0.206
Thomas
TokenThomas
Feature activation+0.107
Jefferson
Token Jefferson
Feature activation+0.242
's
Token's
Feature activation+0.035
Qur
Token Qur
Feature activation+0.143
most
Token most
Feature activation+0.036
instances
Token instances
Feature activation+0.000
.
Token.
Feature activation+0.186
OK
TokenOK
Feature activation+0.256
,
Token,
Feature activation+0.200
now
Token now
Feature activation+0.384
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.037
when
Token when
Feature activation+0.131
I
Token I
Feature activation+0.000

INTERVAL 0.146 - 0.293
CONTAINS 0.013%

115
Token 115
Feature activation+0.175
th
Tokenth
Feature activation+0.103
anniversary
Token anniversary
Feature activation+0.009
.
Token.
Feature activation+0.278
Ċ
TokenĊ
Feature activation+0.263
Ċ
TokenĊ
Feature activation+0.224
"
Token"
Feature activation+0.287
Imp
TokenImp
Feature activation+0.000
ossible
Tokenossible
Feature activation+0.000
!"
Token!"
Feature activation+0.487
will
Token will
Feature activation+0.012
learning
Token learning
Feature activation+0.501
what
Token what
Feature activation+0.385
role
Token role
Feature activation+0.398
the
Token the
Feature activation+0.378
White
Token White
Feature activation+0.150
House
Token House
Feature activation+0.237
may
Token may
Feature activation+0.258
have
Token have
Feature activation+0.305
played
Token played
Feature activation+0.243
in
Token in
Feature activation+0.238
decisions
Token decisions
Feature activation+0.359
sound
Token sound
Feature activation+0.486
too
Token too
Feature activation+0.287
frivolous
Token frivolous
Feature activation+0.233
and
Token and
Feature activation+0.209
he
Token he
Feature activation+0.351
don
Tokendon
Feature activation+0.192
istic
Tokenistic
Feature activation+0.146
,
Token,
Feature activation+0.125
so
Token so
Feature activation+0.113
that
Token that
Feature activation+0.215
people
Token people
Feature activation+0.065
<|endoftext|>
Token<|endoftext|>
Feature activation+0.393
Ann
TokenAnn
Feature activation+0.438
ual
Tokenual
Feature activation+0.280
leave
Token leave
Feature activation+0.393
Ċ
TokenĊ
Feature activation+0.232
Ċ
TokenĊ
Feature activation+0.250
When
TokenWhen
Feature activation+0.343
even
Token even
Feature activation+0.336
the
Token the
Feature activation+0.288
word
Token word
Feature activation+0.416
holiday
Token holiday
Feature activation+0.420
<|endoftext|>
Token<|endoftext|>
Feature activation+0.140
âĢĵ
TokenâĢĵ
Feature activation+0.178
Ċ
TokenĊ
Feature activation+0.123
Ċ
TokenĊ
Feature activation+0.139
The
TokenThe
Feature activation+0.200
St
Token St
Feature activation+0.216
on
Tokenon
Feature activation+0.300
em
Tokenem
Feature activation+0.210
asters
Tokenasters
Feature activation+0.092
:
Token:
Feature activation+0.213
California
Token California
Feature activation+0.179

INTERVAL 0.000 - 0.146
CONTAINS 99.966%

the
Token the
Feature activation+0.000
niece
Token niece
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
"
Token"
Feature activation+0.000
That
TokenThat
Feature activation+0.000
's
Token's
Feature activation+0.000
one
Token one
Feature activation+0.000
thing
Token thing
Feature activation+0.000
I
Token I
Feature activation+0.000
like
Token like
Feature activation+0.000
Virginia
Token Virginia
Feature activation+0.000
State
Token State
Feature activation+0.000
Police
Token Police
Feature activation+0.000
said
Token said
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
officers
Token officers
Feature activation+0.000
were
Token were
Feature activation+0.000
shot
Token shot
Feature activation+0.000
increase
Token increase
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
concentration
Token concentration
Feature activation+0.000
since
Token since
Feature activation+0.000
1850
Token 1850
Feature activation+0.000
finds
Token finds
Feature activation+0.000
its
Token its
Feature activation+0.000
natural
Token natural
Feature activation+0.000
explanation
Token explanation
Feature activation+0.000
in
Token in
Feature activation+0.000
intake
Token intake
Feature activation+0.000
to
Token to
Feature activation+0.000
create
Token create
Feature activation+0.000
billions
Token billions
Feature activation+0.000
of
Token of
Feature activation+0.000
potentially
Token potentially
Feature activation+0.000
toxic
Token toxic
Feature activation+0.000
m
Token m
Feature activation+0.000
ixtures
Tokenixtures
Feature activation+0.000
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
the
Token the
Feature activation+0.000
best
Token best
Feature activation+0.000
preparation
Token preparation
Feature activation+0.000
for
Token for
Feature activation+0.000
graduate
Token graduate
Feature activation+0.000
school
Token school
Feature activation+0.000
you
Token you
Feature activation+0.000
could
Token could
Feature activation+0.000
get
Token get
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 3 in H1.8: (feature 10931

TOP ACTIVATIONS
MAX = 1.368

billionaire
Token billionaire
Feature activation+1.014
Mark
Token Mark
Feature activation+0.909
Cuban
Token Cuban
Feature activation+0.905
has
Token has
Feature activation+0.920
a
Token a
Feature activation+0.930
bone
Token bone
Feature activation+1.368
to
Token to
Feature activation+0.884
pick
Token pick
Feature activation+1.324
with
Token with
Feature activation+0.781
the
Token the
Feature activation+0.839
US
Token US
Feature activation+0.917
Cuban
Token Cuban
Feature activation+0.905
has
Token has
Feature activation+0.920
a
Token a
Feature activation+0.930
bone
Token bone
Feature activation+1.368
to
Token to
Feature activation+0.884
pick
Token pick
Feature activation+1.324
with
Token with
Feature activation+0.781
the
Token the
Feature activation+0.839
US
Token US
Feature activation+0.917
Securities
Token Securities
Feature activation+1.011
and
Token and
Feature activation+0.701
with
Token with
Feature activation+0.781
the
Token the
Feature activation+0.839
US
Token US
Feature activation+0.917
Securities
Token Securities
Feature activation+1.011
and
Token and
Feature activation+0.701
Exchange
Token Exchange
Feature activation+1.293
Commission
Token Commission
Feature activation+1.078
.
Token.
Feature activation+0.637
Less
Token Less
Feature activation+0.702
than
Token than
Feature activation+0.687
a
Token a
Feature activation+0.611
two
Token two
Feature activation+0.691
years
Token years
Feature activation+0.784
ago
Token ago
Feature activation+0.592
,
Token,
Feature activation+0.633
the
Token the
Feature activation+0.600
IRS
Token IRS
Feature activation+1.239
used
Token used
Feature activation+0.755
a
Token a
Feature activation+0.590
controversial
Token controversial
Feature activation+0.925
policy
Token policy
Feature activation+1.051
known
Token known
Feature activation+0.712
6
Token 6
Feature activation+0.982
s
Tokens
Feature activation+0.811
having
Token having
Feature activation+0.618
a
Token a
Feature activation+0.548
smaller
Token smaller
Feature activation+0.819
battery
Token battery
Feature activation+1.164
than
Token than
Feature activation+0.344
its
Token its
Feature activation+0.587
predecessor
Token predecessor
Feature activation+1.033
emerged
Token emerged
Feature activation+0.542
about
Token about
Feature activation+0.307
irsch
Tokenirsch
Feature activation+0.947
.
Token.
Feature activation+0.783
Courtesy
Token Courtesy
Feature activation+1.000
Institute
Token Institute
Feature activation+0.969
for
Token for
Feature activation+0.755
Justice
Token Justice
Feature activation+1.139
More
Token More
Feature activation+0.878
than
Token than
Feature activation+0.851
two
Token two
Feature activation+0.691
years
Token years
Feature activation+0.784
ago
Token ago
Feature activation+0.592
a
Token a
Feature activation+0.590
controversial
Token controversial
Feature activation+0.925
policy
Token policy
Feature activation+1.051
known
Token known
Feature activation+0.712
as
Token as
Feature activation+0.507
civil
Token civil
Feature activation+1.098
forfeiture
Token forfeiture
Feature activation+0.992
to
Token to
Feature activation+0.499
empty
Token empty
Feature activation+0.670
the
Token the
Feature activation+0.472
bank
Token bank
Feature activation+0.957
the
Token the
Feature activation+0.839
US
Token US
Feature activation+0.917
Securities
Token Securities
Feature activation+1.011
and
Token and
Feature activation+0.701
Exchange
Token Exchange
Feature activation+1.293
Commission
Token Commission
Feature activation+1.078
.
Token.
Feature activation+0.637
Less
Token Less
Feature activation+0.702
than
Token than
Feature activation+0.687
a
Token a
Feature activation+0.611
year
Token year
Feature activation+0.854
75
Token75
Feature activation+0.000
million
Tokenmillion
Feature activation+0.000
move
Token move
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.490
Out
TokenOut
Feature activation+1.070
spoken
Tokenspoken
Feature activation+0.896
billionaire
Token billionaire
Feature activation+1.014
Mark
Token Mark
Feature activation+0.909
Cuban
Token Cuban
Feature activation+0.905
has
Token has
Feature activation+0.920
forfeiture
Token forfeiture
Feature activation+0.992
to
Token to
Feature activation+0.499
empty
Token empty
Feature activation+0.670
the
Token the
Feature activation+0.472
bank
Token bank
Feature activation+0.957
account
Token account
Feature activation+1.056
of
Token of
Feature activation+0.440
a
Token a
Feature activation+0.472
small
Token small
Feature activation+0.677
business
Token business
Feature activation+0.857
owned
Token owned
Feature activation+0.722
the
Token the
Feature activation+0.600
IRS
Token IRS
Feature activation+1.239
used
Token used
Feature activation+0.755
a
Token a
Feature activation+0.590
controversial
Token controversial
Feature activation+0.925
policy
Token policy
Feature activation+1.051
known
Token known
Feature activation+0.712
as
Token as
Feature activation+0.507
civil
Token civil
Feature activation+1.098
forfeiture
Token forfeiture
Feature activation+0.992
to
Token to
Feature activation+0.499
a
Token a
Feature activation+0.548
smaller
Token smaller
Feature activation+0.819
battery
Token battery
Feature activation+1.164
than
Token than
Feature activation+0.344
its
Token its
Feature activation+0.587
predecessor
Token predecessor
Feature activation+1.033
emerged
Token emerged
Feature activation+0.542
about
Token about
Feature activation+0.307
a
Token a
Feature activation+0.395
month
Token month
Feature activation+0.582
ago
Token ago
Feature activation+0.268
Less
Token Less
Feature activation+0.702
than
Token than
Feature activation+0.687
a
Token a
Feature activation+0.611
year
Token year
Feature activation+0.854
after
Token after
Feature activation+0.555
scoring
Token scoring
Feature activation+1.022
a
Token a
Feature activation+0.603
huge
Token huge
Feature activation+0.553
victory
Token victory
Feature activation+0.758
against
Token against
Feature activation+0.533
the
Token the
Feature activation+0.469
move
Token move
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.490
Out
TokenOut
Feature activation+1.070
spoken
Tokenspoken
Feature activation+0.896
billionaire
Token billionaire
Feature activation+1.014
Mark
Token Mark
Feature activation+0.909
Cuban
Token Cuban
Feature activation+0.905
has
Token has
Feature activation+0.920
a
Token a
Feature activation+0.930
bone
Token bone
Feature activation+1.368
to
Token to
Feature activation+0.884
pick
Token pick
Feature activation+1.324
with
Token with
Feature activation+0.781
the
Token the
Feature activation+0.839
US
Token US
Feature activation+0.917
Securities
Token Securities
Feature activation+1.011
and
Token and
Feature activation+0.701
Exchange
Token Exchange
Feature activation+1.293
Commission
Token Commission
Feature activation+1.078
.
Token.
Feature activation+0.637
Less
Token Less
Feature activation+0.702
<|endoftext|>
Token<|endoftext|>
Feature activation+0.127
While
TokenWhile
Feature activation+0.412
rumors
Token rumors
Feature activation+0.843
of
Token of
Feature activation+0.535
the
Token the
Feature activation+0.576
iPhone
Token iPhone
Feature activation+1.007
6
Token 6
Feature activation+0.982
s
Tokens
Feature activation+0.811
having
Token having
Feature activation+0.618
a
Token a
Feature activation+0.548
smaller
Token smaller
Feature activation+0.819
<|endoftext|>
Token<|endoftext|>
Feature activation+0.620
Jeff
TokenJeff
Feature activation+0.845
H
Token H
Feature activation+0.981
irsch
Tokenirsch
Feature activation+0.947
.
Token.
Feature activation+0.783
Courtesy
Token Courtesy
Feature activation+1.000
Institute
Token Institute
Feature activation+0.969
for
Token for
Feature activation+0.755
Justice
Token Justice
Feature activation+1.139
More
Token More
Feature activation+0.878
than
Token than
Feature activation+0.851
for
Token for
Feature activation+0.210
India
Token India
Feature activation+0.672
âĢ
TokenâĢ
Feature activation+0.338
Ļ
TokenĻ
Feature activation+0.434
s
Tokens
Feature activation+0.402
nuclear
Token nuclear
Feature activation+0.994
ambitions
Token ambitions
Feature activation+0.722
7
Token 7
Feature activation+0.671
April
Token April
Feature activation+0.460
2017
Token 2017
Feature activation+0.602
Ċ
TokenĊ
Feature activation+0.188
controversial
Token controversial
Feature activation+0.925
policy
Token policy
Feature activation+1.051
known
Token known
Feature activation+0.712
as
Token as
Feature activation+0.507
civil
Token civil
Feature activation+1.098
forfeiture
Token forfeiture
Feature activation+0.992
to
Token to
Feature activation+0.499
empty
Token empty
Feature activation+0.670
the
Token the
Feature activation+0.472
bank
Token bank
Feature activation+0.957
account
Token account
Feature activation+1.056
's
Token's
Feature activation+0.000
Old
Token Old
Feature activation+0.000
Boys
Token Boys
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.526
Ryan
TokenRyan
Feature activation+0.991
Mat
Token Mat
Feature activation+0.876
hews
Tokenhews
Feature activation+0.904
is
Token is
Feature activation+0.596
a
Token a
Feature activation+0.670
bit
Token bit
Feature activation+0.924

Top DFA by src position
MAX = 1.283

£
Token £
Feature activation+0.090
Top resid features:
75
Token75
Feature activation+0.035
Top resid features:
million
Tokenmillion
Feature activation+0.090
Top resid features:
move
Token move
Feature activation+0.072
Top resid features:
.
Token.
Feature activation+0.056
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.467
Top resid features:
Out
TokenOut
Feature activation+0.117
Top resid features:
spoken
Tokenspoken
Feature activation+0.264
Top resid features:
billionaire
Token billionaire
Feature activation+0.122
Top resid features:
Mark
Token Mark
Feature activation+0.145
Top resid features:
Cuban
Token Cuban
Feature activation+0.277
Top resid features:
£
Token £
Feature activation+0.131
Top resid features:
75
Token75
Feature activation+0.032
Top resid features:
million
Tokenmillion
Feature activation+0.088
Top resid features:
move
Token move
Feature activation+0.072
Top resid features:
.
Token.
Feature activation+0.042
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.361
Top resid features:
Out
TokenOut
Feature activation+0.099
Top resid features:
spoken
Tokenspoken
Feature activation+0.168
Top resid features:
billionaire
Token billionaire
Feature activation+0.087
Top resid features:
Mark
Token Mark
Feature activation+0.126
Top resid features:
Cuban
Token Cuban
Feature activation+0.247
Top resid features:
to
Token to
Feature activation+0.089
Top resid features:
pick
Token pick
Feature activation+0.125
Top resid features:
with
Token with
Feature activation+0.127
Top resid features:
the
Token the
Feature activation+0.078
Top resid features:
US
Token US
Feature activation+0.110
Top resid features:
Securities
Token Securities
Feature activation+0.276
Top resid features:
and
Token and
Feature activation+0.166
Top resid features:
Exchange
Token Exchange
Feature activation+0.266
Top resid features:
Commission
Token Commission
Feature activation+0.000
Top resid features:
.
Token.
Feature activation+0.000
Top resid features:
Less
Token Less
Feature activation+0.000
Top resid features:
ed
Tokened
Feature activation+0.021
Top resid features:
iy
Tokeniy
Feature activation+0.021
Top resid features:
es
Tokenes
Feature activation+0.037
Top resid features:
por
Tokenpor
Feature activation+0.041
Top resid features:
.
Token.
Feature activation+0.037
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.297
Top resid features:
Jeff
TokenJeff
Feature activation+0.065
Top resid features:
H
Token H
Feature activation+0.074
Top resid features:
irsch
Tokenirsch
Feature activation+0.108
Top resid features:
.
Token.
Feature activation+0.090
Top resid features:
Courtesy
Token Courtesy
Feature activation+0.120
Top resid features:
and
Token and
Feature activation+0.034
Top resid features:
selling
Token selling
Feature activation+0.046
Top resid features:
talent
Token talent
Feature activation+0.041
Top resid features:
abroad
Token abroad
Feature activation+0.045
Top resid features:
.
Token.
Feature activation+0.043
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.393
Top resid features:
While
TokenWhile
Feature activation+0.133
Top resid features:
rumors
Token rumors
Feature activation+0.283
Top resid features:
of
Token of
Feature activation+0.124
Top resid features:
the
Token the
Feature activation+0.090
Top resid features:
iPhone
Token iPhone
Feature activation+0.222
Top resid features:
ed
Tokened
Feature activation+0.040
Top resid features:
iy
Tokeniy
Feature activation+0.036
Top resid features:
es
Tokenes
Feature activation+0.059
Top resid features:
por
Tokenpor
Feature activation+0.047
Top resid features:
.
Token.
Feature activation+0.051
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.518
Top resid features:
Jeff
TokenJeff
Feature activation+0.110
Top resid features:
H
Token H
Feature activation+0.128
Top resid features:
irsch
Tokenirsch
Feature activation+0.198
Top resid features:
.
Token.
Feature activation+0.179
Top resid features:
Courtesy
Token Courtesy
Feature activation+0.253
Top resid features:
a
Token a
Feature activation+0.103
Top resid features:
controversial
Token controversial
Feature activation+0.121
Top resid features:
policy
Token policy
Feature activation+0.144
Top resid features:
known
Token known
Feature activation+0.179
Top resid features:
as
Token as
Feature activation+0.164
Top resid features:
civil
Token civil
Feature activation+0.219
Top resid features:
forfeiture
Token forfeiture
Feature activation+0.000
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
empty
Token empty
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
bank
Token bank
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.078
Top resid features:
US
Token US
Feature activation+0.073
Top resid features:
Securities
Token Securities
Feature activation+0.231
Top resid features:
and
Token and
Feature activation+0.147
Top resid features:
Exchange
Token Exchange
Feature activation+0.153
Top resid features:
Commission
Token Commission
Feature activation+0.272
Top resid features:
.
Token.
Feature activation+0.000
Top resid features:
Less
Token Less
Feature activation+0.000
Top resid features:
than
Token than
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
year
Token year
Feature activation+0.000
Top resid features:
£
Token £
Feature activation+0.093
Top resid features:
75
Token75
Feature activation+0.056
Top resid features:
million
Tokenmillion
Feature activation+0.096
Top resid features:
move
Token move
Feature activation+0.075
Top resid features:
.
Token.
Feature activation+0.051
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+1.194
Top resid features:
Out
TokenOut
Feature activation+0.332
Top resid features:
spoken
Tokenspoken
Feature activation+0.000
Top resid features:
billionaire
Token billionaire
Feature activation+0.000
Top resid features:
Mark
Token Mark
Feature activation+0.000
Top resid features:
Cuban
Token Cuban
Feature activation+0.000
Top resid features:
civil
Token civil
Feature activation+0.057
Top resid features:
forfeiture
Token forfeiture
Feature activation+0.224
Top resid features:
to
Token to
Feature activation+0.090
Top resid features:
empty
Token empty
Feature activation+0.110
Top resid features:
the
Token the
Feature activation+0.080
Top resid features:
bank
Token bank
Feature activation+0.278
Top resid features:
account
Token account
Feature activation+0.174
Top resid features:
of
Token of
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
small
Token small
Feature activation+0.000
Top resid features:
business
Token business
Feature activation+0.000
Top resid features:
two
Token two
Feature activation+0.084
Top resid features:
years
Token years
Feature activation+0.101
Top resid features:
ago
Token ago
Feature activation+0.098
Top resid features:
,
Token,
Feature activation+0.105
Top resid features:
the
Token the
Feature activation+0.091
Top resid features:
IRS
Token IRS
Feature activation+0.311
Top resid features:
used
Token used
Feature activation+0.157
Top resid features:
a
Token a
Feature activation+0.140
Top resid features:
controversial
Token controversial
Feature activation+0.149
Top resid features:
policy
Token policy
Feature activation+0.243
Top resid features:
known
Token known
Feature activation+0.000
Top resid features:
and
Token and
Feature activation+0.028
Top resid features:
selling
Token selling
Feature activation+0.045
Top resid features:
talent
Token talent
Feature activation+0.038
Top resid features:
abroad
Token abroad
Feature activation+0.041
Top resid features:
.
Token.
Feature activation+0.035
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.307
Top resid features:
While
TokenWhile
Feature activation+0.093
Top resid features:
rumors
Token rumors
Feature activation+0.174
Top resid features:
of
Token of
Feature activation+0.083
Top resid features:
the
Token the
Feature activation+0.069
Top resid features:
iPhone
Token iPhone
Feature activation+0.211
Top resid features:
Luk
Token Luk
Feature activation+0.033
Top resid features:
aku
Tokenaku
Feature activation+0.080
Top resid features:
,
Token,
Feature activation+0.014
Top resid features:
who
Token who
Feature activation+0.018
Top resid features:
joined
Token joined
Feature activation+0.048
Top resid features:
Manchester
Token Manchester
Feature activation+0.236
Top resid features:
United
Token United
Feature activation+0.152
Top resid features:
in
Token in
Feature activation+0.024
Top resid features:
a
Token a
Feature activation+0.025
Top resid features:
£
Token £
Feature activation+0.105
Top resid features:
75
Token75
Feature activation+0.014
Top resid features:
£
Token £
Feature activation+0.097
Top resid features:
75
Token75
Feature activation+0.050
Top resid features:
million
Tokenmillion
Feature activation+0.090
Top resid features:
move
Token move
Feature activation+0.088
Top resid features:
.
Token.
Feature activation+0.063
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.927
Top resid features:
Out
TokenOut
Feature activation+0.216
Top resid features:
spoken
Tokenspoken
Feature activation+0.366
Top resid features:
billionaire
Token billionaire
Feature activation+0.207
Top resid features:
Mark
Token Mark
Feature activation+0.000
Top resid features:
Cuban
Token Cuban
Feature activation+0.000
Top resid features:
£
Token £
Feature activation+0.096
Top resid features:
75
Token75
Feature activation+0.025
Top resid features:
million
Tokenmillion
Feature activation+0.079
Top resid features:
move
Token move
Feature activation+0.068
Top resid features:
.
Token.
Feature activation+0.044
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.337
Top resid features:
Out
TokenOut
Feature activation+0.076
Top resid features:
spoken
Tokenspoken
Feature activation+0.128
Top resid features:
billionaire
Token billionaire
Feature activation+0.056
Top resid features:
Mark
Token Mark
Feature activation+0.103
Top resid features:
Cuban
Token Cuban
Feature activation+0.203
Top resid features:
and
Token and
Feature activation+0.045
Top resid features:
selling
Token selling
Feature activation+0.059
Top resid features:
talent
Token talent
Feature activation+0.053
Top resid features:
abroad
Token abroad
Feature activation+0.047
Top resid features:
.
Token.
Feature activation+0.057
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.537
Top resid features:
While
TokenWhile
Feature activation+0.202
Top resid features:
rumors
Token rumors
Feature activation+0.425
Top resid features:
of
Token of
Feature activation+0.182
Top resid features:
the
Token the
Feature activation+0.170
Top resid features:
iPhone
Token iPhone
Feature activation+0.355
Top resid features:
ed
Tokened
Feature activation+0.045
Top resid features:
iy
Tokeniy
Feature activation+0.043
Top resid features:
es
Tokenes
Feature activation+0.056
Top resid features:
por
Tokenpor
Feature activation+0.048
Top resid features:
.
Token.
Feature activation+0.044
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.742
Top resid features:
Jeff
TokenJeff
Feature activation+0.190
Top resid features:
H
Token H
Feature activation+0.183
Top resid features:
irsch
Tokenirsch
Feature activation+0.308
Top resid features:
.
Token.
Feature activation+0.243
Top resid features:
Courtesy
Token Courtesy
Feature activation+0.340
Top resid features:
1
Token1
Feature activation+0.038
Top resid features:
Liverpool
Token Liverpool
Feature activation+0.088
Top resid features:
Drew
Token Drew
Feature activation+0.081
Top resid features:
Man
Token Man
Feature activation+0.031
Top resid features:
Ut
Token Ut
Feature activation+0.043
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.668
Top resid features:
Fuel
TokenFuel
Feature activation+0.017
Top resid features:
for
Token for
Feature activation+0.195
Top resid features:
India
Token India
Feature activation+0.258
Top resid features:
âĢ
TokenâĢ
Feature activation+0.159
Top resid features:
Ļ
TokenĻ
Feature activation+0.142
Top resid features:
controversial
Token controversial
Feature activation+0.098
Top resid features:
policy
Token policy
Feature activation+0.110
Top resid features:
known
Token known
Feature activation+0.145
Top resid features:
as
Token as
Feature activation+0.129
Top resid features:
civil
Token civil
Feature activation+0.047
Top resid features:
forfeiture
Token forfeiture
Feature activation+0.366
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
empty
Token empty
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
bank
Token bank
Feature activation+0.000
Top resid features:
account
Token account
Feature activation+0.000
Top resid features:
ell
Tokenell
Feature activation+0.077
Top resid features:
's
Token's
Feature activation+0.053
Top resid features:
Old
Token Old
Feature activation+0.040
Top resid features:
Boys
Token Boys
Feature activation+0.139
Top resid features:
.
Token.
Feature activation+0.066
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+1.283
Top resid features:
Ryan
TokenRyan
Feature activation+0.291
Top resid features:
Mat
Token Mat
Feature activation+0.000
Top resid features:
hews
Tokenhews
Feature activation+0.000
Top resid features:
is
Token is
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:

Decoder Weights Distribution

Head 0: 0.05

Head 1: 0.10

Head 2: 0.07

Head 3: 0.04

Head 4: 0.07

Head 5: 0.07

Head 6: 0.05

Head 7: 0.11

Head 8: 0.18

Head 9: 0.11

Head 10: 0.06

Head 11: 0.10

Positive logits

CLOSE1.26

ormon1.16

ABC1.13

Patreon1.11

Characters1.06

Anonymous1.06

Attorney1.05

Feminist1.05

Confederate1.04

umbn1.03

anim1.03

anal1.02

Bethesda1.02

FBI1.02

File1.01

Null1.01

MSM1.00

kat1.00

Navajo1.00

Ep1.00

Negative logits

relegation-1.27

nerv-1.25

Wembley-1.23

Dortmund-1.19

landsl-1.14

Mata-1.14

clinch-1.13

unbeaten-1.12

slump-1.11

goalkeeper-1.11

Guardiola-1.10

defences-1.09

agre-1.09

analyse-1.09

outings-1.09

strength-1.09

Ake-1.09

stride-1.08

territ-1.08

rounds-1.08

INTERVAL 1.231 - 1.368
CONTAINS 0.000%

two
Token two
Feature activation+0.691
years
Token years
Feature activation+0.784
ago
Token ago
Feature activation+0.592
,
Token,
Feature activation+0.633
the
Token the
Feature activation+0.600
IRS
Token IRS
Feature activation+1.239
used
Token used
Feature activation+0.755
a
Token a
Feature activation+0.590
controversial
Token controversial
Feature activation+0.925
policy
Token policy
Feature activation+1.051
known
Token known
Feature activation+0.712
billionaire
Token billionaire
Feature activation+1.014
Mark
Token Mark
Feature activation+0.909
Cuban
Token Cuban
Feature activation+0.905
has
Token has
Feature activation+0.920
a
Token a
Feature activation+0.930
bone
Token bone
Feature activation+1.368
to
Token to
Feature activation+0.884
pick
Token pick
Feature activation+1.324
with
Token with
Feature activation+0.781
the
Token the
Feature activation+0.839
US
Token US
Feature activation+0.917
Cuban
Token Cuban
Feature activation+0.905
has
Token has
Feature activation+0.920
a
Token a
Feature activation+0.930
bone
Token bone
Feature activation+1.368
to
Token to
Feature activation+0.884
pick
Token pick
Feature activation+1.324
with
Token with
Feature activation+0.781
the
Token the
Feature activation+0.839
US
Token US
Feature activation+0.917
Securities
Token Securities
Feature activation+1.011
and
Token and
Feature activation+0.701
with
Token with
Feature activation+0.781
the
Token the
Feature activation+0.839
US
Token US
Feature activation+0.917
Securities
Token Securities
Feature activation+1.011
and
Token and
Feature activation+0.701
Exchange
Token Exchange
Feature activation+1.293
Commission
Token Commission
Feature activation+1.078
.
Token.
Feature activation+0.637
Less
Token Less
Feature activation+0.702
than
Token than
Feature activation+0.687
a
Token a
Feature activation+0.611

INTERVAL 1.095 - 1.231
CONTAINS 0.000%

6
Token 6
Feature activation+0.982
s
Tokens
Feature activation+0.811
having
Token having
Feature activation+0.618
a
Token a
Feature activation+0.548
smaller
Token smaller
Feature activation+0.819
battery
Token battery
Feature activation+1.164
than
Token than
Feature activation+0.344
its
Token its
Feature activation+0.587
predecessor
Token predecessor
Feature activation+1.033
emerged
Token emerged
Feature activation+0.542
about
Token about
Feature activation+0.307
irsch
Tokenirsch
Feature activation+0.947
.
Token.
Feature activation+0.783
Courtesy
Token Courtesy
Feature activation+1.000
Institute
Token Institute
Feature activation+0.969
for
Token for
Feature activation+0.755
Justice
Token Justice
Feature activation+1.139
More
Token More
Feature activation+0.878
than
Token than
Feature activation+0.851
two
Token two
Feature activation+0.691
years
Token years
Feature activation+0.784
ago
Token ago
Feature activation+0.592
a
Token a
Feature activation+0.590
controversial
Token controversial
Feature activation+0.925
policy
Token policy
Feature activation+1.051
known
Token known
Feature activation+0.712
as
Token as
Feature activation+0.507
civil
Token civil
Feature activation+1.098
forfeiture
Token forfeiture
Feature activation+0.992
to
Token to
Feature activation+0.499
empty
Token empty
Feature activation+0.670
the
Token the
Feature activation+0.472
bank
Token bank
Feature activation+0.957

INTERVAL 0.958 - 1.095
CONTAINS 0.002%

es
Tokenes
Feature activation+0.000
por
Tokenpor
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.620
Jeff
TokenJeff
Feature activation+0.845
H
Token H
Feature activation+0.981
irsch
Tokenirsch
Feature activation+0.947
.
Token.
Feature activation+0.783
Courtesy
Token Courtesy
Feature activation+1.000
Institute
Token Institute
Feature activation+0.969
for
Token for
Feature activation+0.755
the
Token the
Feature activation+0.839
US
Token US
Feature activation+0.917
Securities
Token Securities
Feature activation+1.011
and
Token and
Feature activation+0.701
Exchange
Token Exchange
Feature activation+1.293
Commission
Token Commission
Feature activation+1.078
.
Token.
Feature activation+0.637
Less
Token Less
Feature activation+0.702
than
Token than
Feature activation+0.687
a
Token a
Feature activation+0.611
year
Token year
Feature activation+0.854
forfeiture
Token forfeiture
Feature activation+0.992
to
Token to
Feature activation+0.499
empty
Token empty
Feature activation+0.670
the
Token the
Feature activation+0.472
bank
Token bank
Feature activation+0.957
account
Token account
Feature activation+1.056
of
Token of
Feature activation+0.440
a
Token a
Feature activation+0.472
small
Token small
Feature activation+0.677
business
Token business
Feature activation+0.857
owned
Token owned
Feature activation+0.722
Jeff
TokenJeff
Feature activation+0.845
H
Token H
Feature activation+0.981
irsch
Tokenirsch
Feature activation+0.947
.
Token.
Feature activation+0.783
Courtesy
Token Courtesy
Feature activation+1.000
Institute
Token Institute
Feature activation+0.969
for
Token for
Feature activation+0.755
Justice
Token Justice
Feature activation+1.139
More
Token More
Feature activation+0.878
than
Token than
Feature activation+0.851
two
Token two
Feature activation+0.691
to
Token to
Feature activation+0.884
pick
Token pick
Feature activation+1.324
with
Token with
Feature activation+0.781
the
Token the
Feature activation+0.839
US
Token US
Feature activation+0.917
Securities
Token Securities
Feature activation+1.011
and
Token and
Feature activation+0.701
Exchange
Token Exchange
Feature activation+1.293
Commission
Token Commission
Feature activation+1.078
.
Token.
Feature activation+0.637
Less
Token Less
Feature activation+0.702

INTERVAL 0.821 - 0.958
CONTAINS 0.003%

Courtesy
Token Courtesy
Feature activation+1.000
Institute
Token Institute
Feature activation+0.969
for
Token for
Feature activation+0.755
Justice
Token Justice
Feature activation+1.139
More
Token More
Feature activation+0.878
than
Token than
Feature activation+0.851
two
Token two
Feature activation+0.691
years
Token years
Feature activation+0.784
ago
Token ago
Feature activation+0.592
,
Token,
Feature activation+0.633
the
Token the
Feature activation+0.600
Boys
Token Boys
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.526
Ryan
TokenRyan
Feature activation+0.991
Mat
Token Mat
Feature activation+0.876
hews
Tokenhews
Feature activation+0.904
is
Token is
Feature activation+0.596
a
Token a
Feature activation+0.670
bit
Token bit
Feature activation+0.924
of
Token of
Feature activation+0.630
a
Token a
Feature activation+0.696
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.490
Out
TokenOut
Feature activation+1.070
spoken
Tokenspoken
Feature activation+0.896
billionaire
Token billionaire
Feature activation+1.014
Mark
Token Mark
Feature activation+0.909
Cuban
Token Cuban
Feature activation+0.905
has
Token has
Feature activation+0.920
a
Token a
Feature activation+0.930
bone
Token bone
Feature activation+1.368
to
Token to
Feature activation+0.884
Old
Token Old
Feature activation+0.000
Boys
Token Boys
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.526
Ryan
TokenRyan
Feature activation+0.991
Mat
Token Mat
Feature activation+0.876
hews
Tokenhews
Feature activation+0.904
is
Token is
Feature activation+0.596
a
Token a
Feature activation+0.670
bit
Token bit
Feature activation+0.924
of
Token of
Feature activation+0.630
spoken
Tokenspoken
Feature activation+0.896
billionaire
Token billionaire
Feature activation+1.014
Mark
Token Mark
Feature activation+0.909
Cuban
Token Cuban
Feature activation+0.905
has
Token has
Feature activation+0.920
a
Token a
Feature activation+0.930
bone
Token bone
Feature activation+1.368
to
Token to
Feature activation+0.884
pick
Token pick
Feature activation+1.324
with
Token with
Feature activation+0.781
the
Token the
Feature activation+0.839

INTERVAL 0.684 - 0.821
CONTAINS 0.003%

pick
Token pick
Feature activation+1.324
with
Token with
Feature activation+0.781
the
Token the
Feature activation+0.839
US
Token US
Feature activation+0.917
Securities
Token Securities
Feature activation+1.011
and
Token and
Feature activation+0.701
Exchange
Token Exchange
Feature activation+1.293
Commission
Token Commission
Feature activation+1.078
.
Token.
Feature activation+0.637
Less
Token Less
Feature activation+0.702
than
Token than
Feature activation+0.687
no
Token no
Feature activation+0.337
mention
Token mention
Feature activation+0.226
of
Token of
Feature activation+0.152
the
Token the
Feature activation+0.188
new
Token new
Feature activation+0.289
device
Token device
Feature activation+0.693
âĢ
TokenâĢ
Feature activation+0.261
Ļ
TokenĻ
Feature activation+0.283
s
Tokens
Feature activation+0.245
exact
Token exact
Feature activation+0.288
power
Token power
Feature activation+0.808
be
Token be
Feature activation+0.581
a
Token a
Feature activation+0.547
top
Token top
Feature activation+0.630
-
Token-
Feature activation+0.508
five
Tokenfive
Feature activation+0.570
NFL
Token NFL
Feature activation+0.798
running
Token running
Feature activation+0.790
back
Token back
Feature activation+0.535
,
Token,
Feature activation+0.248
but
Token but
Feature activation+0.282
injury
Token injury
Feature activation+0.679
âĢ
TokenâĢ
Feature activation+0.261
Ļ
TokenĻ
Feature activation+0.283
s
Tokens
Feature activation+0.245
exact
Token exact
Feature activation+0.288
power
Token power
Feature activation+0.808
capacity
Token capacity
Feature activation+0.738
.
Token.
Feature activation+0.177
Ċ
TokenĊ
Feature activation+0.111
Ċ
TokenĊ
Feature activation+0.002
However
TokenHowever
Feature activation+0.169
,
Token,
Feature activation+0.000
iPhone
Token iPhone
Feature activation+1.007
6
Token 6
Feature activation+0.982
s
Tokens
Feature activation+0.811
having
Token having
Feature activation+0.618
a
Token a
Feature activation+0.548
smaller
Token smaller
Feature activation+0.819
battery
Token battery
Feature activation+1.164
than
Token than
Feature activation+0.344
its
Token its
Feature activation+0.587
predecessor
Token predecessor
Feature activation+1.033
emerged
Token emerged
Feature activation+0.542

INTERVAL 0.547 - 0.684
CONTAINS 0.006%

Prosecutors
TokenProsecutors
Feature activation+0.608
often
Token often
Feature activation+0.589
understand
Token understand
Feature activation+0.486
what
Token what
Feature activation+0.475
âĢ
TokenâĢ
Feature activation+0.541
Ļ
TokenĻ
Feature activation+0.664
s
Tokens
Feature activation+0.524
going
Token going
Feature activation+0.623
on
Token on
Feature activation+0.435
but
Token but
Feature activation+0.478
threaten
Token threaten
Feature activation+0.688
looks
Token looks
Feature activation+0.808
like
Token like
Feature activation+0.858
he
Token he
Feature activation+0.701
could
Token could
Feature activation+0.579
be
Token be
Feature activation+0.581
a
Token a
Feature activation+0.547
top
Token top
Feature activation+0.630
-
Token-
Feature activation+0.508
five
Tokenfive
Feature activation+0.570
NFL
Token NFL
Feature activation+0.798
running
Token running
Feature activation+0.790
av
Tokenav
Feature activation+0.250
J
Token J
Feature activation+0.403
ha
Tokenha
Feature activation+0.478
gives
Token gives
Feature activation+0.259
an
Token an
Feature activation+0.289
update
Token update
Feature activation+0.581
on
Token on
Feature activation+0.362
India
Token India
Feature activation+0.520
âĢ
TokenâĢ
Feature activation+0.189
Ļ
TokenĻ
Feature activation+0.259
s
Tokens
Feature activation+0.287
video
Token video
Feature activation+0.391
that
Token that
Feature activation+0.314
features
Token features
Feature activation+0.453
the
Token the
Feature activation+0.333
legitimate
Token legitimate
Feature activation+0.501
Man
Token Man
Feature activation+0.553
b
Tokenb
Feature activation+0.405
ij
Tokenij
Feature activation+0.231
Revolutionary
Token Revolutionary
Feature activation+0.435
Local
Token Local
Feature activation+0.424
Council
Token Council
Feature activation+0.447
having
Token having
Feature activation+0.618
a
Token a
Feature activation+0.548
smaller
Token smaller
Feature activation+0.819
battery
Token battery
Feature activation+1.164
than
Token than
Feature activation+0.344
its
Token its
Feature activation+0.587
predecessor
Token predecessor
Feature activation+1.033
emerged
Token emerged
Feature activation+0.542
about
Token about
Feature activation+0.307
a
Token a
Feature activation+0.395
month
Token month
Feature activation+0.582

INTERVAL 0.410 - 0.547
CONTAINS 0.009%

This
TokenThis
Feature activation+0.458
article
Token article
Feature activation+0.440
is
Token is
Feature activation+0.306
over
Token over
Feature activation+0.392
3
Token 3
Feature activation+0.485
years
Token years
Feature activation+0.463
old
Token old
Feature activation+0.478
Ċ
TokenĊ
Feature activation+0.261
Ċ
TokenĊ
Feature activation+0.176
Police
TokenPolice
Feature activation+0.551
in
Token in
Feature activation+0.293
with
Token with
Feature activation+0.229
english
Token english
Feature activation+0.376
subtitles
Token subtitles
Feature activation+0.463
announcing
Token announcing
Feature activation+0.317
the
Token the
Feature activation+0.156
creation
Token creation
Feature activation+0.449
of
Token of
Feature activation+0.043
a
Token a
Feature activation+0.124
FSA
Token FSA
Feature activation+0.513
Man
Token Man
Feature activation+0.325
b
Tokenb
Feature activation+0.122
scoring
Token scoring
Feature activation+1.022
a
Token a
Feature activation+0.603
huge
Token huge
Feature activation+0.553
victory
Token victory
Feature activation+0.758
against
Token against
Feature activation+0.533
the
Token the
Feature activation+0.469
regulator
Token regulator
Feature activation+0.828
,
Token,
Feature activation+0.312
he
Token he
Feature activation+0.501
took
Token took
Feature activation+0.243
to
Token to
Feature activation+0.267
legitimate
Token legitimate
Feature activation+0.501
Man
Token Man
Feature activation+0.553
b
Tokenb
Feature activation+0.405
ij
Tokenij
Feature activation+0.231
Revolutionary
Token Revolutionary
Feature activation+0.435
Local
Token Local
Feature activation+0.424
Council
Token Council
Feature activation+0.447
with
Token with
Feature activation+0.229
english
Token english
Feature activation+0.376
subtitles
Token subtitles
Feature activation+0.463
announcing
Token announcing
Feature activation+0.317
plans
Token plans
Feature activation+0.444
to
Token to
Feature activation+0.167
use
Token use
Feature activation+0.154
its
Token its
Feature activation+0.167
abundant
Token abundant
Feature activation+0.219
thor
Token thor
Feature activation+0.476
ium
Tokenium
Feature activation+0.305
resources
Token resources
Feature activation+0.169
for
Token for
Feature activation+0.000
nuclear
Token nuclear
Feature activation+0.373
power
Token power
Feature activation+0.373

INTERVAL 0.274 - 0.410
CONTAINS 0.010%

<|endoftext|>
Token<|endoftext|>
Feature activation+0.045
This
TokenThis
Feature activation+0.224
is
Token is
Feature activation+0.150
a
Token a
Feature activation+0.266
video
Token video
Feature activation+0.391
that
Token that
Feature activation+0.314
features
Token features
Feature activation+0.453
the
Token the
Feature activation+0.333
legitimate
Token legitimate
Feature activation+0.501
Man
Token Man
Feature activation+0.553
b
Tokenb
Feature activation+0.405
made
Token made
Feature activation+0.432
no
Token no
Feature activation+0.337
mention
Token mention
Feature activation+0.226
of
Token of
Feature activation+0.152
the
Token the
Feature activation+0.188
new
Token new
Feature activation+0.289
device
Token device
Feature activation+0.693
âĢ
TokenâĢ
Feature activation+0.261
Ļ
TokenĻ
Feature activation+0.283
s
Tokens
Feature activation+0.245
exact
Token exact
Feature activation+0.288
ium
Tokenium
Feature activation+0.305
resources
Token resources
Feature activation+0.169
for
Token for
Feature activation+0.000
nuclear
Token nuclear
Feature activation+0.373
power
Token power
Feature activation+0.373
production
Token production
Feature activation+0.336
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
End
TokenEnd
Feature activation+0.000
owed
Tokenowed
Feature activation+0.000
acknowledged
Token acknowledged
Feature activation+0.334
as
Token as
Feature activation+0.175
one
Token one
Feature activation+0.259
of
Token of
Feature activation+0.192
the
Token the
Feature activation+0.238
master
Token master
Feature activation+0.372
pieces
Tokenpieces
Feature activation+0.447
of
Token of
Feature activation+0.338
20
Token 20
Feature activation+0.382
th
Tokenth
Feature activation+0.422
-
Token-
Feature activation+0.241
s
Tokens
Feature activation+0.811
having
Token having
Feature activation+0.618
a
Token a
Feature activation+0.548
smaller
Token smaller
Feature activation+0.819
battery
Token battery
Feature activation+1.164
than
Token than
Feature activation+0.344
its
Token its
Feature activation+0.587
predecessor
Token predecessor
Feature activation+1.033
emerged
Token emerged
Feature activation+0.542
about
Token about
Feature activation+0.307
a
Token a
Feature activation+0.395

INTERVAL 0.137 - 0.274
CONTAINS 0.019%

emerged
Token emerged
Feature activation+0.542
about
Token about
Feature activation+0.307
a
Token a
Feature activation+0.395
month
Token month
Feature activation+0.582
ago
Token ago
Feature activation+0.268
,
Token,
Feature activation+0.247
Apple
Token Apple
Feature activation+0.593
today
Token today
Feature activation+0.348
made
Token made
Feature activation+0.432
no
Token no
Feature activation+0.337
mention
Token mention
Feature activation+0.226
first
Token first
Feature activation+0.350
Super
Token Super
Feature activation+0.280
Tuesday
Token Tuesday
Feature activation+0.198
âĢĶ
Token âĢĶ
Feature activation+0.110
when
Token when
Feature activation+0.140
Democrats
Token Democrats
Feature activation+0.184
in
Token in
Feature activation+0.138
11
Token 11
Feature activation+0.177
states
Token states
Feature activation+0.335
will
Token will
Feature activation+0.109
weigh
Token weigh
Feature activation+0.067
Hos
Token Hos
Feature activation+0.000
kins
Tokenkins
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
The
TokenThe
Feature activation+0.239
Lich
Token Lich
Feature activation+0.217
King
Token King
Feature activation+0.262
commands
Token commands
Feature activation+0.276
you
Token you
Feature activation+0.268
Ċ
TokenĊ
Feature activation+0.186
Ċ
TokenĊ
Feature activation+0.071
Looking
TokenLooking
Feature activation+0.115
The
TokenThe
Feature activation+0.299
Sound
Token Sound
Feature activation+0.439
And
Token And
Feature activation+0.402
The
Token The
Feature activation+0.228
Fury
Token Fury
Feature activation+0.431
âĢ
TokenâĢ
Feature activation+0.147
Ŀ
TokenĿ
Feature activation+0.333
is
Token is
Feature activation+0.255
acknowledged
Token acknowledged
Feature activation+0.334
as
Token as
Feature activation+0.175
one
Token one
Feature activation+0.259
Obama
Token Obama
Feature activation+0.000
bases
Token bases
Feature activation+0.002
his
Token his
Feature activation+0.000
surrender
Token surrender
Feature activation+0.076
to
Token to
Feature activation+0.000
Iran
Token Iran
Feature activation+0.142
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
nuclear
Token nuclear
Feature activation+0.347
ambitions
Token ambitions
Feature activation+0.158

INTERVAL 0.000 - 0.137
CONTAINS 99.948%

Frank
Token Frank
Feature activation+0.000
Miller
Token Miller
Feature activation+0.000
and
Token and
Feature activation+0.000
Jack
Token Jack
Feature activation+0.000
Kirby
Token Kirby
Feature activation+0.000
as
Token as
Feature activation+0.000
their
Token their
Feature activation+0.000
major
Token major
Feature activation+0.000
artistic
Token artistic
Feature activation+0.000
influences
Token influences
Feature activation+0.000
.
Token.
Feature activation+0.000
is
Token is
Feature activation+0.000
unlike
Token unlike
Feature activation+0.000
the
Token the
Feature activation+0.000
US
Token US
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
for
Token for
Feature activation+0.000
volume
Token volume
Feature activation+0.000
alone
Token alone
Feature activation+0.000
has
Token has
Feature activation+0.000
cups
Token cups
Feature activation+0.000
euros
Token euros
Feature activation+0.000
per
Token per
Feature activation+0.000
year
Token year
Feature activation+0.000
,
Token,
Feature activation+0.000
respectively
Token respectively
Feature activation+0.000
,
Token,
Feature activation+0.000
over
Token over
Feature activation+0.000
a
Token a
Feature activation+0.000
decade
Token decade
Feature activation+0.000
by
Token by
Feature activation+0.000
lowering
Token lowering
Feature activation+0.000
,
Token,
Feature activation+0.000
then
Token then
Feature activation+0.000
the
Token the
Feature activation+0.000
wires
Token wires
Feature activation+0.000
,
Token,
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
he
Token he
Feature activation+0.000
said
Token said
Feature activation+0.000
.
Token.
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
declining
Token declining
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
In
TokenIn
Feature activation+0.000
November
Token November
Feature activation+0.000
,
Token,
Feature activation+0.000
fewer
Token fewer
Feature activation+0.000
people
Token people
Feature activation+0.000
boarded
Token boarded
Feature activation+0.000
Metro
Token Metro
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 4 in H1.8: (feature 13894

TOP ACTIVATIONS
MAX = 0.436

and
Token and
Feature activation+0.000
Afghanistan
Token Afghanistan
Feature activation+0.000
were
Token were
Feature activation+0.032
formally
Token formally
Feature activation+0.008
charged
Token charged
Feature activation+0.172
with
Token with
Feature activation+0.436
raping
Token raping
Feature activation+0.359
3
Token 3
Feature activation+0.038
,
Token,
Feature activation+0.023
374
Token374
Feature activation+0.000
local
Token local
Feature activation+0.000
Afghanistan
Token Afghanistan
Feature activation+0.000
were
Token were
Feature activation+0.032
formally
Token formally
Feature activation+0.008
charged
Token charged
Feature activation+0.172
with
Token with
Feature activation+0.436
raping
Token raping
Feature activation+0.359
3
Token 3
Feature activation+0.038
,
Token,
Feature activation+0.023
374
Token374
Feature activation+0.000
local
Token local
Feature activation+0.000
women
Token women
Feature activation+0.000
is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
unique
Token unique
Feature activation+0.000
pointer
Token pointer
Feature activation+0.000
implementation
Token implementation
Feature activation+0.000
that
Token that
Feature activation+0.218
does
Token does
Feature activation+0.096
not
Token not
Feature activation+0.000
support
Token support
Feature activation+0.000
copying
Token copying
Feature activation+0.000
or
Token or
Feature activation+0.000
Iraq
Token Iraq
Feature activation+0.000
and
Token and
Feature activation+0.000
Afghanistan
Token Afghanistan
Feature activation+0.000
were
Token were
Feature activation+0.032
formally
Token formally
Feature activation+0.008
charged
Token charged
Feature activation+0.172
with
Token with
Feature activation+0.436
raping
Token raping
Feature activation+0.359
3
Token 3
Feature activation+0.038
,
Token,
Feature activation+0.023
374
Token374
Feature activation+0.000
Imagine
TokenImagine
Feature activation+0.000
if
Token if
Feature activation+0.000
American
Token American
Feature activation+0.000
troops
Token troops
Feature activation+0.000
stationed
Token stationed
Feature activation+0.000
in
Token in
Feature activation+0.099
Iraq
Token Iraq
Feature activation+0.000
and
Token and
Feature activation+0.000
Afghanistan
Token Afghanistan
Feature activation+0.000
were
Token were
Feature activation+0.032
formally
Token formally
Feature activation+0.008
a
Token a
Feature activation+0.000
unique
Token unique
Feature activation+0.000
pointer
Token pointer
Feature activation+0.000
implementation
Token implementation
Feature activation+0.000
that
Token that
Feature activation+0.218
does
Token does
Feature activation+0.096
not
Token not
Feature activation+0.000
support
Token support
Feature activation+0.000
copying
Token copying
Feature activation+0.000
or
Token or
Feature activation+0.000
copy
Token copy
Feature activation+0.000
res
Tokenres
Feature activation+0.000
idence
Tokenidence
Feature activation+0.000
card
Token card
Feature activation+0.000
Keep
Token Keep
Feature activation+0.000
the
Token the
Feature activation+0.002
following
Token following
Feature activation+0.089
items
Token items
Feature activation+0.000
in
Token in
Feature activation+0.092
a
Token a
Feature activation+0.116
place
Token place
Feature activation+0.058
where
Token where
Feature activation+0.084
were
Token were
Feature activation+0.032
formally
Token formally
Feature activation+0.008
charged
Token charged
Feature activation+0.172
with
Token with
Feature activation+0.436
raping
Token raping
Feature activation+0.359
3
Token 3
Feature activation+0.038
,
Token,
Feature activation+0.023
374
Token374
Feature activation+0.000
local
Token local
Feature activation+0.000
women
Token women
Feature activation+0.000
a
Token a
Feature activation+0.000
stationed
Token stationed
Feature activation+0.000
in
Token in
Feature activation+0.099
Iraq
Token Iraq
Feature activation+0.000
and
Token and
Feature activation+0.000
Afghanistan
Token Afghanistan
Feature activation+0.000
were
Token were
Feature activation+0.032
formally
Token formally
Feature activation+0.008
charged
Token charged
Feature activation+0.172
with
Token with
Feature activation+0.436
raping
Token raping
Feature activation+0.359
3
Token 3
Feature activation+0.038
formally
Token formally
Feature activation+0.008
charged
Token charged
Feature activation+0.172
with
Token with
Feature activation+0.436
raping
Token raping
Feature activation+0.359
3
Token 3
Feature activation+0.038
,
Token,
Feature activation+0.023
374
Token374
Feature activation+0.000
local
Token local
Feature activation+0.000
women
Token women
Feature activation+0.000
a
Token a
Feature activation+0.000
year
Token year
Feature activation+0.000
in
Token in
Feature activation+0.099
Iraq
Token Iraq
Feature activation+0.000
and
Token and
Feature activation+0.000
Afghanistan
Token Afghanistan
Feature activation+0.000
were
Token were
Feature activation+0.032
formally
Token formally
Feature activation+0.008
charged
Token charged
Feature activation+0.172
with
Token with
Feature activation+0.436
raping
Token raping
Feature activation+0.359
3
Token 3
Feature activation+0.038
,
Token,
Feature activation+0.023
gun
Token gun
Feature activation+0.000
rights
Token rights
Feature activation+0.000
activist
Token activist
Feature activation+0.000
Mind
Token Mind
Feature activation+0.000
y
Tokeny
Feature activation+0.000
Costa
Token Costa
Feature activation+0.008
,
Token,
Feature activation+0.000
who
Token who
Feature activation+0.000
grew
Token grew
Feature activation+0.000
up
Token up
Feature activation+0.000
in
Token in
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
res
Tokenres
Feature activation+0.000
idence
Tokenidence
Feature activation+0.000
card
Token card
Feature activation+0.000
Keep
Token Keep
Feature activation+0.000
the
Token the
Feature activation+0.002
following
Token following
Feature activation+0.089
items
Token items
Feature activation+0.000
in
Token in
Feature activation+0.092
a
Token a
Feature activation+0.116
place
Token place
Feature activation+0.058
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000

Top DFA by src position
MAX = 1.278

and
Token and
Feature activation+0.100
Top resid features:
Afghanistan
Token Afghanistan
Feature activation+0.084
Top resid features:
were
Token were
Feature activation+0.135
Top resid features:
formally
Token formally
Feature activation+0.250
Top resid features:
charged
Token charged
Feature activation+0.356
Top resid features:
with
Token with
Feature activation+0.510
Top resid features:
raping
Token raping
Feature activation+0.000
Top resid features:
3
Token 3
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
374
Token374
Feature activation+0.000
Top resid features:
local
Token local
Feature activation+0.000
Top resid features:
Afghanistan
Token Afghanistan
Feature activation+0.087
Top resid features:
were
Token were
Feature activation+0.109
Top resid features:
formally
Token formally
Feature activation+0.162
Top resid features:
charged
Token charged
Feature activation+0.202
Top resid features:
with
Token with
Feature activation+0.129
Top resid features:
raping
Token raping
Feature activation+0.434
Top resid features:
3
Token 3
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
374
Token374
Feature activation+0.000
Top resid features:
local
Token local
Feature activation+0.000
Top resid features:
women
Token women
Feature activation+0.000
Top resid features:
is
Token is
Feature activation+0.122
Top resid features:
a
Token a
Feature activation+0.152
Top resid features:
unique
Token unique
Feature activation+0.070
Top resid features:
pointer
Token pointer
Feature activation+0.129
Top resid features:
implementation
Token implementation
Feature activation+0.213
Top resid features:
that
Token that
Feature activation+0.545
Top resid features:
does
Token does
Feature activation+0.000
Top resid features:
not
Token not
Feature activation+0.000
Top resid features:
support
Token support
Feature activation+0.000
Top resid features:
copying
Token copying
Feature activation+0.000
Top resid features:
or
Token or
Feature activation+0.000
Top resid features:
Iraq
Token Iraq
Feature activation+0.098
Top resid features:
and
Token and
Feature activation+0.114
Top resid features:
Afghanistan
Token Afghanistan
Feature activation+0.077
Top resid features:
were
Token were
Feature activation+0.134
Top resid features:
formally
Token formally
Feature activation+0.239
Top resid features:
charged
Token charged
Feature activation+0.559
Top resid features:
with
Token with
Feature activation+0.000
Top resid features:
raping
Token raping
Feature activation+0.000
Top resid features:
3
Token 3
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
374
Token374
Feature activation+0.000
Top resid features:
Imagine
TokenImagine
Feature activation+0.154
Top resid features:
if
Token if
Feature activation+0.112
Top resid features:
American
Token American
Feature activation+0.288
Top resid features:
troops
Token troops
Feature activation+0.250
Top resid features:
stationed
Token stationed
Feature activation+0.329
Top resid features:
in
Token in
Feature activation+0.685
Top resid features:
Iraq
Token Iraq
Feature activation+0.000
Top resid features:
and
Token and
Feature activation+0.000
Top resid features:
Afghanistan
Token Afghanistan
Feature activation+0.000
Top resid features:
were
Token were
Feature activation+0.000
Top resid features:
formally
Token formally
Feature activation+0.000
Top resid features:
is
Token is
Feature activation+0.077
Top resid features:
a
Token a
Feature activation+0.132
Top resid features:
unique
Token unique
Feature activation+0.052
Top resid features:
pointer
Token pointer
Feature activation+0.114
Top resid features:
implementation
Token implementation
Feature activation+0.155
Top resid features:
that
Token that
Feature activation+0.450
Top resid features:
does
Token does
Feature activation+0.133
Top resid features:
not
Token not
Feature activation+0.000
Top resid features:
support
Token support
Feature activation+0.000
Top resid features:
copying
Token copying
Feature activation+0.000
Top resid features:
or
Token or
Feature activation+0.000
Top resid features:
res
Tokenres
Feature activation+0.098
Top resid features:
idence
Tokenidence
Feature activation+0.083
Top resid features:
card
Token card
Feature activation+0.159
Top resid features:
Keep
Token Keep
Feature activation+0.173
Top resid features:
the
Token the
Feature activation+0.233
Top resid features:
following
Token following
Feature activation+0.647
Top resid features:
items
Token items
Feature activation+0.000
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
place
Token place
Feature activation+0.000
Top resid features:
where
Token where
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.438
Top resid features:
MS
TokenMS
Feature activation+0.014
Top resid features:
NBC
TokenNBC
Feature activation+0.043
Top resid features:
Ċ
TokenĊ
Feature activation+0.028
Top resid features:
Ċ
TokenĊ
Feature activation+0.022
Top resid features:
in
Tokenin
Feature activation+0.008
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.462
Top resid features:
MS
TokenMS
Feature activation+0.011
Top resid features:
NBC
TokenNBC
Feature activation+0.025
Top resid features:
Ċ
TokenĊ
Feature activation+0.020
Top resid features:
Ċ
TokenĊ
Feature activation+0.015
Top resid features:
in
Tokenin
Feature activation+0.011
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.441
Top resid features:
MS
TokenMS
Feature activation+0.010
Top resid features:
NBC
TokenNBC
Feature activation+0.022
Top resid features:
Ċ
TokenĊ
Feature activation+0.017
Top resid features:
Ċ
TokenĊ
Feature activation+0.014
Top resid features:
in
Tokenin
Feature activation+0.010
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.414
Top resid features:
MS
TokenMS
Feature activation+0.010
Top resid features:
NBC
TokenNBC
Feature activation+0.029
Top resid features:
Ċ
TokenĊ
Feature activation+0.019
Top resid features:
Ċ
TokenĊ
Feature activation+0.016
Top resid features:
in
Tokenin
Feature activation+0.007
Top resid features:
gun
Token gun
Feature activation+0.117
Top resid features:
rights
Token rights
Feature activation+0.166
Top resid features:
activist
Token activist
Feature activation+0.257
Top resid features:
Mind
Token Mind
Feature activation+0.065
Top resid features:
y
Tokeny
Feature activation+0.056
Top resid features:
Costa
Token Costa
Feature activation+0.489
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
who
Token who
Feature activation+0.000
Top resid features:
grew
Token grew
Feature activation+0.000
Top resid features:
up
Token up
Feature activation+0.000
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
Ċ
TokenĊ
Feature activation-0.051
Top resid features:
res
Tokenres
Feature activation+0.126
Top resid features:
idence
Tokenidence
Feature activation+0.092
Top resid features:
card
Token card
Feature activation+0.180
Top resid features:
Keep
Token Keep
Feature activation+0.296
Top resid features:
the
Token the
Feature activation+0.620
Top resid features:
following
Token following
Feature activation+0.000
Top resid features:
items
Token items
Feature activation+0.000
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
place
Token place
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.889
Top resid features:
ile
Tokenile
Feature activation+0.075
Top resid features:
(
Token (
Feature activation-0.002
Top resid features:
Michael
TokenMichael
Feature activation-0.136
Top resid features:
C
Token C
Feature activation+0.046
Top resid features:
aine
Tokenaine
Feature activation+0.026
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+1.009
Top resid features:
ile
Tokenile
Feature activation+0.015
Top resid features:
(
Token (
Feature activation+0.015
Top resid features:
Michael
TokenMichael
Feature activation-0.196
Top resid features:
C
Token C
Feature activation+0.061
Top resid features:
aine
Tokenaine
Feature activation-0.001
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.868
Top resid features:
ile
Tokenile
Feature activation+0.051
Top resid features:
(
Token (
Feature activation+0.034
Top resid features:
Michael
TokenMichael
Feature activation-0.127
Top resid features:
C
Token C
Feature activation+0.070
Top resid features:
aine
Tokenaine
Feature activation+0.019
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+1.018
Top resid features:
ile
Tokenile
Feature activation+0.026
Top resid features:
(
Token (
Feature activation+0.042
Top resid features:
Michael
TokenMichael
Feature activation-0.252
Top resid features:
C
Token C
Feature activation+0.129
Top resid features:
aine
Tokenaine
Feature activation+0.064
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+1.278
Top resid features:
ile
Tokenile
Feature activation+0.011
Top resid features:
(
Token (
Feature activation+0.017
Top resid features:
Michael
TokenMichael
Feature activation-0.321
Top resid features:
C
Token C
Feature activation+0.181
Top resid features:
aine
Tokenaine
Feature activation+0.186
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+1.003
Top resid features:
ile
Tokenile
Feature activation+0.043
Top resid features:
(
Token (
Feature activation+0.013
Top resid features:
Michael
TokenMichael
Feature activation-0.204
Top resid features:
C
Token C
Feature activation+0.077
Top resid features:
aine
Tokenaine
Feature activation+0.022
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.991
Top resid features:
ile
Tokenile
Feature activation+0.010
Top resid features:
(
Token (
Feature activation+0.007
Top resid features:
Michael
TokenMichael
Feature activation-0.172
Top resid features:
C
Token C
Feature activation+0.061
Top resid features:
aine
Tokenaine
Feature activation+0.017
Top resid features:

Decoder Weights Distribution

Head 0: 0.06

Head 1: 0.09

Head 2: 0.07

Head 3: 0.05

Head 4: 0.06

Head 5: 0.06

Head 6: 0.06

Head 7: 0.11

Head 8: 0.17

Head 9: 0.12

Head 10: 0.09

Head 11: 0.06

Positive logits

Warranty1.88

Wallet1.87

awei1.81

Interested1.81

KDE1.77

warranties1.72

payer1.59

1.58

uga1.58

profits1.52

widgets1.52

Bay1.51

Wallet1.51

widget1.51

enthal1.50

carriers1.49

GW1.48

wallets1.46

Samsung1.46

xual1.46

Negative logits

thence-1.82

simulac-1.50

pty-1.41

æ-1.39

erer-1.35

nit-1.35

cffffcc-1.34

ac-1.30

dd-1.29

himself-1.28

opher-1.26

indist-1.26

Judd-1.25

esi-1.25

icy-1.25

slept-1.24

sold-1.23

dw-1.22

YS-1.22

thunder-1.22

INTERVAL 0.392 - 0.436
CONTAINS 0.000%

and
Token and
Feature activation+0.000
Afghanistan
Token Afghanistan
Feature activation+0.000
were
Token were
Feature activation+0.032
formally
Token formally
Feature activation+0.008
charged
Token charged
Feature activation+0.172
with
Token with
Feature activation+0.436
raping
Token raping
Feature activation+0.359
3
Token 3
Feature activation+0.038
,
Token,
Feature activation+0.023
374
Token374
Feature activation+0.000
local
Token local
Feature activation+0.000

INTERVAL 0.348 - 0.392
CONTAINS 0.000%

Afghanistan
Token Afghanistan
Feature activation+0.000
were
Token were
Feature activation+0.032
formally
Token formally
Feature activation+0.008
charged
Token charged
Feature activation+0.172
with
Token with
Feature activation+0.436
raping
Token raping
Feature activation+0.359
3
Token 3
Feature activation+0.038
,
Token,
Feature activation+0.023
374
Token374
Feature activation+0.000
local
Token local
Feature activation+0.000
women
Token women
Feature activation+0.000

INTERVAL 0.305 - 0.348
CONTAINS 0.000%

INTERVAL 0.261 - 0.305
CONTAINS 0.000%

INTERVAL 0.218 - 0.261
CONTAINS 0.000%

is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
unique
Token unique
Feature activation+0.000
pointer
Token pointer
Feature activation+0.000
implementation
Token implementation
Feature activation+0.000
that
Token that
Feature activation+0.218
does
Token does
Feature activation+0.096
not
Token not
Feature activation+0.000
support
Token support
Feature activation+0.000
copying
Token copying
Feature activation+0.000
or
Token or
Feature activation+0.000

INTERVAL 0.174 - 0.218
CONTAINS 0.000%

INTERVAL 0.131 - 0.174
CONTAINS 0.000%

Iraq
Token Iraq
Feature activation+0.000
and
Token and
Feature activation+0.000
Afghanistan
Token Afghanistan
Feature activation+0.000
were
Token were
Feature activation+0.032
formally
Token formally
Feature activation+0.008
charged
Token charged
Feature activation+0.172
with
Token with
Feature activation+0.436
raping
Token raping
Feature activation+0.359
3
Token 3
Feature activation+0.038
,
Token,
Feature activation+0.023
374
Token374
Feature activation+0.000

INTERVAL 0.087 - 0.131
CONTAINS 0.001%

res
Tokenres
Feature activation+0.000
idence
Tokenidence
Feature activation+0.000
card
Token card
Feature activation+0.000
Keep
Token Keep
Feature activation+0.000
the
Token the
Feature activation+0.002
following
Token following
Feature activation+0.089
items
Token items
Feature activation+0.000
in
Token in
Feature activation+0.092
a
Token a
Feature activation+0.116
place
Token place
Feature activation+0.058
where
Token where
Feature activation+0.084
Imagine
TokenImagine
Feature activation+0.000
if
Token if
Feature activation+0.000
American
Token American
Feature activation+0.000
troops
Token troops
Feature activation+0.000
stationed
Token stationed
Feature activation+0.000
in
Token in
Feature activation+0.099
Iraq
Token Iraq
Feature activation+0.000
and
Token and
Feature activation+0.000
Afghanistan
Token Afghanistan
Feature activation+0.000
were
Token were
Feature activation+0.032
formally
Token formally
Feature activation+0.008
a
Token a
Feature activation+0.000
unique
Token unique
Feature activation+0.000
pointer
Token pointer
Feature activation+0.000
implementation
Token implementation
Feature activation+0.000
that
Token that
Feature activation+0.218
does
Token does
Feature activation+0.096
not
Token not
Feature activation+0.000
support
Token support
Feature activation+0.000
copying
Token copying
Feature activation+0.000
or
Token or
Feature activation+0.000
copy
Token copy
Feature activation+0.000

INTERVAL 0.044 - 0.087
CONTAINS 0.000%

INTERVAL 0.000 - 0.044
CONTAINS 99.999%

said
Token said
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
Even
TokenEven
Feature activation+0.000
if
Token if
Feature activation+0.000
we
Token we
Feature activation+0.000
ceased
Token ceased
Feature activation+0.000
all
Token all
Feature activation+0.000
hold
Token hold
Feature activation+0.000
while
Token while
Feature activation+0.000
Sheen
Token Sheen
Feature activation+0.000
tried
Token tried
Feature activation+0.000
rehab
Token rehab
Feature activation+0.000
,
Token,
Feature activation+0.000
reportedly
Token reportedly
Feature activation+0.000
at
Token at
Feature activation+0.000
home
Token home
Feature activation+0.000
,
Token,
Feature activation+0.000
but
Token but
Feature activation+0.000
of
Token of
Feature activation+0.000
action
Token action
Feature activation+0.000
and
Token and
Feature activation+0.000
response
Token response
Feature activation+0.000
patterns
Token patterns
Feature activation+0.000
than
Token than
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ĺ
Tokenĺ
Feature activation+0.000
regular
Tokenregular
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
reach
Token reach
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Equ
TokenEqu
Feature activation+0.000
ally
Tokenally
Feature activation+0.000
surprising
Token surprising
Feature activation+0.000
is
Token is
Feature activation+0.000
Trump
Token Trump
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
0
Token0
Feature activation+0.000
39
Token39
Feature activation+0.000
Estimated
Token Estimated
Feature activation+0.000
Muslim
Token Muslim
Feature activation+0.000
Population
Token Population
Feature activation+0.000
400
Token 400
Feature activation+0.000
,
Token,
Feature activation+0.000
000
Token000
Feature activation+0.000
+
Token+
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 5 in H1.8: (feature 13781

TOP ACTIVATIONS
MAX = 0.115

being
Token being
Feature activation+0.000
a
Token a
Feature activation+0.000
blessing
Token blessing
Feature activation+0.000
to
Token to
Feature activation+0.000
their
Token their
Feature activation+0.000
children
Token children
Feature activation+0.115
and
Token and
Feature activation+0.000
we
Token we
Feature activation+0.000
forget
Token forget
Feature activation+0.000
the
Token the
Feature activation+0.000
the
Token the
Feature activation+0.000
but
Token but
Feature activation+0.000
he
Token he
Feature activation+0.000
always
Token always
Feature activation+0.000
said
Token said
Feature activation+0.000
he
Token he
Feature activation+0.000
liked
Token liked
Feature activation+0.085
that
Token that
Feature activation+0.000
about
Token about
Feature activation+0.000
me
Token me
Feature activation+0.000
,
Token,
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
she
Token she
Feature activation+0.000
loves
Token loves
Feature activation+0.000
her
Token her
Feature activation+0.000
sister
Token sister
Feature activation+0.000
and
Token and
Feature activation+0.000
loves
Token loves
Feature activation+0.046
watching
Token watching
Feature activation+0.000
people
Token people
Feature activation+0.000
betray
Token betray
Feature activation+0.000
and
Token and
Feature activation+0.000
kill
Token kill
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000

Top DFA by src position
MAX = 0.932

my
Token my
Feature activation+0.046
Top resid features:
relationship
Token relationship
Feature activation+0.105
Top resid features:
with
Token with
Feature activation+0.030
Top resid features:
him
Token him
Feature activation+0.072
Top resid features:
.
Token.
Feature activation+0.069
Top resid features:
Parents
Token Parents
Feature activation+0.610
Top resid features:
are
Token are
Feature activation+0.065
Top resid features:
really
Token really
Feature activation+0.087
Top resid features:
obsessed
Token obsessed
Feature activation+0.122
Top resid features:
with
Token with
Feature activation+0.052
Top resid features:
being
Token being
Feature activation+0.087
Top resid features:
but
Token but
Feature activation+0.099
Top resid features:
he
Token he
Feature activation+0.047
Top resid features:
always
Token always
Feature activation+0.120
Top resid features:
said
Token said
Feature activation+0.100
Top resid features:
he
Token he
Feature activation+0.021
Top resid features:
liked
Token liked
Feature activation+0.932
Top resid features:
that
Token that
Feature activation+0.000
Top resid features:
about
Token about
Feature activation+0.000
Top resid features:
me
Token me
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
Top resid features:
on
Token on
Feature activation+0.031
Top resid features:
about
Token about
Feature activation+0.050
Top resid features:
how
Token how
Feature activation+0.078
Top resid features:
much
Token much
Feature activation+0.074
Top resid features:
she
Token she
Feature activation+0.140
Top resid features:
loves
Token loves
Feature activation+0.650
Top resid features:
her
Token her
Feature activation+0.078
Top resid features:
sister
Token sister
Feature activation+0.305
Top resid features:
and
Token and
Feature activation+0.113
Top resid features:
loves
Token loves
Feature activation+0.379
Top resid features:
watching
Token watching
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.409
Top resid features:
ile
Tokenile
Feature activation+0.022
Top resid features:
(
Token (
Feature activation-0.028
Top resid features:
Michael
TokenMichael
Feature activation+0.036
Top resid features:
C
Token C
Feature activation-0.011
Top resid features:
aine
Tokenaine
Feature activation-0.042
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.623
Top resid features:
ile
Tokenile
Feature activation-0.091
Top resid features:
(
Token (
Feature activation-0.282
Top resid features:
Michael
TokenMichael
Feature activation+0.130
Top resid features:
C
Token C
Feature activation-0.056
Top resid features:
aine
Tokenaine
Feature activation-0.476
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.400
Top resid features:
ile
Tokenile
Feature activation+0.014
Top resid features:
(
Token (
Feature activation-0.035
Top resid features:
Michael
TokenMichael
Feature activation+0.031
Top resid features:
C
Token C
Feature activation-0.006
Top resid features:
aine
Tokenaine
Feature activation-0.047
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.433
Top resid features:
ile
Tokenile
Feature activation+0.020
Top resid features:
(
Token (
Feature activation-0.042
Top resid features:
Michael
TokenMichael
Feature activation+0.034
Top resid features:
C
Token C
Feature activation-0.014
Top resid features:
aine
Tokenaine
Feature activation-0.052
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.410
Top resid features:
ile
Tokenile
Feature activation+0.030
Top resid features:
(
Token (
Feature activation-0.037
Top resid features:
Michael
TokenMichael
Feature activation+0.033
Top resid features:
C
Token C
Feature activation-0.009
Top resid features:
aine
Tokenaine
Feature activation-0.064
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.424
Top resid features:
ile
Tokenile
Feature activation+0.021
Top resid features:
(
Token (
Feature activation-0.059
Top resid features:
Michael
TokenMichael
Feature activation+0.044
Top resid features:
C
Token C
Feature activation-0.025
Top resid features:
aine
Tokenaine
Feature activation-0.069
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.415
Top resid features:
ile
Tokenile
Feature activation-0.005
Top resid features:
(
Token (
Feature activation-0.064
Top resid features:
Michael
TokenMichael
Feature activation+0.050
Top resid features:
C
Token C
Feature activation-0.023
Top resid features:
aine
Tokenaine
Feature activation-0.073
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.384
Top resid features:
ile
Tokenile
Feature activation+0.023
Top resid features:
(
Token (
Feature activation-0.046
Top resid features:
Michael
TokenMichael
Feature activation+0.053
Top resid features:
C
Token C
Feature activation-0.028
Top resid features:
aine
Tokenaine
Feature activation-0.060
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.498
Top resid features:
ile
Tokenile
Feature activation+0.016
Top resid features:
(
Token (
Feature activation-0.126
Top resid features:
Michael
TokenMichael
Feature activation+0.098
Top resid features:
C
Token C
Feature activation-0.035
Top resid features:
aine
Tokenaine
Feature activation-0.106
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.375
Top resid features:
ile
Tokenile
Feature activation+0.032
Top resid features:
(
Token (
Feature activation-0.040
Top resid features:
Michael
TokenMichael
Feature activation+0.043
Top resid features:
C
Token C
Feature activation-0.016
Top resid features:
aine
Tokenaine
Feature activation-0.057
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.537
Top resid features:
ile
Tokenile
Feature activation-0.052
Top resid features:
(
Token (
Feature activation-0.163
Top resid features:
Michael
TokenMichael
Feature activation+0.108
Top resid features:
C
Token C
Feature activation-0.046
Top resid features:
aine
Tokenaine
Feature activation-0.163
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.487
Top resid features:
ile
Tokenile
Feature activation-0.094
Top resid features:
(
Token (
Feature activation-0.294
Top resid features:
Michael
TokenMichael
Feature activation+0.127
Top resid features:
C
Token C
Feature activation-0.058
Top resid features:
aine
Tokenaine
Feature activation-0.193
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.528
Top resid features:
ile
Tokenile
Feature activation-0.084
Top resid features:
(
Token (
Feature activation-0.213
Top resid features:
Michael
TokenMichael
Feature activation+0.130
Top resid features:
C
Token C
Feature activation-0.049
Top resid features:
aine
Tokenaine
Feature activation-0.197
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.486
Top resid features:
ile
Tokenile
Feature activation+0.004
Top resid features:
(
Token (
Feature activation-0.076
Top resid features:
Michael
TokenMichael
Feature activation+0.069
Top resid features:
C
Token C
Feature activation-0.026
Top resid features:
aine
Tokenaine
Feature activation-0.087
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.454
Top resid features:
ile
Tokenile
Feature activation+0.023
Top resid features:
(
Token (
Feature activation-0.113
Top resid features:
Michael
TokenMichael
Feature activation+0.085
Top resid features:
C
Token C
Feature activation-0.031
Top resid features:
aine
Tokenaine
Feature activation-0.094
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.450
Top resid features:
ile
Tokenile
Feature activation-0.006
Top resid features:
(
Token (
Feature activation-0.127
Top resid features:
Michael
TokenMichael
Feature activation+0.103
Top resid features:
C
Token C
Feature activation-0.018
Top resid features:
aine
Tokenaine
Feature activation-0.116
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.543
Top resid features:
ile
Tokenile
Feature activation-0.035
Top resid features:
(
Token (
Feature activation-0.134
Top resid features:
Michael
TokenMichael
Feature activation+0.093
Top resid features:
C
Token C
Feature activation-0.026
Top resid features:
aine
Tokenaine
Feature activation-0.166
Top resid features:

Decoder Weights Distribution

Head 0: 0.05

Head 1: 0.07

Head 2: 0.07

Head 3: 0.03

Head 4: 0.08

Head 5: 0.07

Head 6: 0.07

Head 7: 0.12

Head 8: 0.15

Head 9: 0.13

Head 10: 0.07

Head 11: 0.08

Positive logits

admins1.43

resil1.40

tools1.40

inators1.36

ministic1.33

igans1.31

agus1.30

acci1.30

Splash1.24

DoS1.24

Awareness1.22

Carnage1.21

plugins1.21

ifax1.18

agna1.18

Browser1.18

rame1.18

Community1.18

imal1.17

gardening1.17

Negative logits

HCR-1.68

elector-1.54

kHz-1.41

framing-1.29

ゴン-1.25

ーテ-1.24

oys-1.23

Byr-1.23

annex-1.22

displ-1.20

-1.19

§§-1.18

ridor-1.18

Landing-1.16

elev-1.15

emption-1.15

tion-1.14

paraph-1.14

departing-1.14

deem-1.13

INTERVAL 0.103 - 0.115
CONTAINS 0.000%

being
Token being
Feature activation+0.000
a
Token a
Feature activation+0.000
blessing
Token blessing
Feature activation+0.000
to
Token to
Feature activation+0.000
their
Token their
Feature activation+0.000
children
Token children
Feature activation+0.115
and
Token and
Feature activation+0.000
we
Token we
Feature activation+0.000
forget
Token forget
Feature activation+0.000
the
Token the
Feature activation+0.000
the
Token the
Feature activation+0.000

INTERVAL 0.092 - 0.103
CONTAINS 0.000%

INTERVAL 0.080 - 0.092
CONTAINS 0.000%

but
Token but
Feature activation+0.000
he
Token he
Feature activation+0.000
always
Token always
Feature activation+0.000
said
Token said
Feature activation+0.000
he
Token he
Feature activation+0.000
liked
Token liked
Feature activation+0.085
that
Token that
Feature activation+0.000
about
Token about
Feature activation+0.000
me
Token me
Feature activation+0.000
,
Token,
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000

INTERVAL 0.069 - 0.080
CONTAINS 0.000%

INTERVAL 0.057 - 0.069
CONTAINS 0.000%

INTERVAL 0.046 - 0.057
CONTAINS 0.000%

she
Token she
Feature activation+0.000
loves
Token loves
Feature activation+0.000
her
Token her
Feature activation+0.000
sister
Token sister
Feature activation+0.000
and
Token and
Feature activation+0.000
loves
Token loves
Feature activation+0.046
watching
Token watching
Feature activation+0.000
people
Token people
Feature activation+0.000
betray
Token betray
Feature activation+0.000
and
Token and
Feature activation+0.000
kill
Token kill
Feature activation+0.000

INTERVAL 0.034 - 0.046
CONTAINS 0.000%

INTERVAL 0.023 - 0.034
CONTAINS 0.000%

INTERVAL 0.011 - 0.023
CONTAINS 0.000%

INTERVAL 0.000 - 0.011
CONTAINS 100.000%

half
Token half
Feature activation+0.000
of
Token of
Feature activation+0.000
Trump
Token Trump
Feature activation+0.000
's
Token's
Feature activation+0.000
supporters
Token supporters
Feature activation+0.000
into
Token into
Feature activation+0.000
what
Token what
Feature activation+0.000
I
Token I
Feature activation+0.000
call
Token call
Feature activation+0.000
the
Token the
Feature activation+0.000
basket
Token basket
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
Northern
Token Northern
Feature activation+0.000
Kentucky
Token Kentucky
Feature activation+0.000
9
Token 9
Feature activation+0.000
/
Token/
Feature activation+0.000
11
Token11
Feature activation+0.000
Memorial
Token Memorial
Feature activation+0.000
.
Token.
Feature activation+0.000
Berry
Token Berry
Feature activation+0.000
's
Token's
Feature activation+0.000
people
Token people
Feature activation+0.000
in
Token in
Feature activation+0.000
America
Token America
Feature activation+0.000
.
Token.
Feature activation+0.000
How
Token How
Feature activation+0.000
ie
Tokenie
Feature activation+0.000
Carr
Token Carr
Feature activation+0.000
is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
Boston
Token Boston
Feature activation+0.000
area
Token area
Feature activation+0.000
been
Token been
Feature activation+0.000
negative
Token negative
Feature activation+0.000
because
Token because
Feature activation+0.000
journalists
Token journalists
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
agree
Token agree
Feature activation+0.000
with
Token with
Feature activation+0.000
Trump
Token Trump
Feature activation+0.000
's
Token's
Feature activation+0.000
policies
Token policies
Feature activation+0.000
s
Tokens
Feature activation+0.000
land
Token land
Feature activation+0.000
mass
Tokenmass
Feature activation+0.000
âĢĵ
Token âĢĵ
Feature activation+0.000
it
Token it
Feature activation+0.000
is
Token is
Feature activation+0.000
possible
Token possible
Feature activation+0.000
that
Token that
Feature activation+0.000
the
Token the
Feature activation+0.000
most
Token most
Feature activation+0.000
recent
Token recent
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 6 in H1.8: (feature 6402

TOP ACTIVATIONS
MAX = 1.544

early
Token early
Feature activation+1.397
1995
Token 1995
Feature activation+1.506
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
.
Token.
Feature activation+0.180
Ċ
TokenĊ
Feature activation+0.576
Ċ
TokenĊ
Feature activation+0.800
In
TokenIn
Feature activation+1.416
early
Token early
Feature activation+1.397
1995
Token 1995
Feature activation+1.506
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
Ċ
TokenĊ
Feature activation+0.800
In
TokenIn
Feature activation+1.416
early
Token early
Feature activation+1.397
1995
Token 1995
Feature activation+1.506
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
1995
Token 1995
Feature activation+1.506
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
Times
Token Times
Feature activation+0.000
newsletters
Token newsletters
Feature activation+0.000
.
Token.
Feature activation+0.180
Ċ
TokenĊ
Feature activation+0.576
Ċ
TokenĊ
Feature activation+0.800
In
TokenIn
Feature activation+1.416
early
Token early
Feature activation+1.397
1995
Token 1995
Feature activation+1.506
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
newsletters
Token newsletters
Feature activation+0.000
.
Token.
Feature activation+0.180
Ċ
TokenĊ
Feature activation+0.576
Ċ
TokenĊ
Feature activation+0.800
In
TokenIn
Feature activation+1.416
early
Token early
Feature activation+1.397
1995
Token 1995
Feature activation+1.506
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
In
TokenIn
Feature activation+1.416
early
Token early
Feature activation+1.397
1995
Token 1995
Feature activation+1.506
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
Ċ
TokenĊ
Feature activation+0.576
Ċ
TokenĊ
Feature activation+0.800
In
TokenIn
Feature activation+1.416
early
Token early
Feature activation+1.397
1995
Token 1995
Feature activation+1.506
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
.
Token.
Feature activation+1.345
McCain
Token McCain
Feature activation+1.106
promoted
Token promoted
Feature activation+1.232
a
Token a
Feature activation+1.260
moratorium
Token moratorium
Feature activation+1.267
on
Token on
Feature activation+1.140
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
.
Token.
Feature activation+1.345
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
.
Token.
Feature activation+1.345
McCain
Token McCain
Feature activation+1.106
promoted
Token promoted
Feature activation+1.232
a
Token a
Feature activation+1.260
nder
Tokennder
Feature activation+1.070
fur
Tokenfur
Feature activation+1.140
th
Tokenth
Feature activation+1.150
,
Token,
Feature activation+1.186
who
Token who
Feature activation+1.160
is
Token is
Feature activation+1.268
now
Token now
Feature activation+1.235
a
Token a
Feature activation+1.075
professor
Token professor
Feature activation+0.926
of
Token of
Feature activation+0.999
international
Token international
Feature activation+1.002
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
.
Token.
Feature activation+1.345
McCain
Token McCain
Feature activation+1.106
âĢ
TokenâĢ
Feature activation+1.026
Ŀ
TokenĿ
Feature activation+0.958
said
Token said
Feature activation+1.079
Mr
Token Mr
Feature activation+1.046
.
Token.
Feature activation+1.077
I
Token I
Feature activation+1.255
nder
Tokennder
Feature activation+1.070
fur
Tokenfur
Feature activation+1.140
th
Tokenth
Feature activation+1.150
,
Token,
Feature activation+1.186
who
Token who
Feature activation+1.160
fur
Tokenfur
Feature activation+1.140
th
Tokenth
Feature activation+1.150
,
Token,
Feature activation+1.186
who
Token who
Feature activation+1.160
is
Token is
Feature activation+1.268
now
Token now
Feature activation+1.235
a
Token a
Feature activation+1.075
professor
Token professor
Feature activation+0.926
of
Token of
Feature activation+0.999
international
Token international
Feature activation+1.002
affairs
Token affairs
Feature activation+0.901
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
.
Token.
Feature activation+1.345
McCain
Token McCain
Feature activation+1.106
promoted
Token promoted
Feature activation+1.232
a
Token a
Feature activation+1.260
moratorium
Token moratorium
Feature activation+1.267
on
Token on
Feature activation+1.140
federal
Token federal
Feature activation+1.033
regulations
Token regulations
Feature activation+1.192
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
.
Token.
Feature activation+1.345
McCain
Token McCain
Feature activation+1.106
promoted
Token promoted
Feature activation+1.232
Ċ
TokenĊ
Feature activation+0.295
âĢ
TokenâĢ
Feature activation+1.074
ľ
Tokenľ
Feature activation+0.766
The
TokenThe
Feature activation+1.009
situation
Token situation
Feature activation+1.062
has
Token has
Feature activation+1.201
changed
Token changed
Feature activation+1.024
significantly
Token significantly
Feature activation+0.941
in
Token in
Feature activation+0.895
recent
Token recent
Feature activation+1.112
years
Token years
Feature activation+0.861
.
Token.
Feature activation+1.077
I
Token I
Feature activation+1.255
nder
Tokennder
Feature activation+1.070
fur
Tokenfur
Feature activation+1.140
th
Tokenth
Feature activation+1.150
,
Token,
Feature activation+1.186
who
Token who
Feature activation+1.160
is
Token is
Feature activation+1.268
now
Token now
Feature activation+1.235
a
Token a
Feature activation+1.075
professor
Token professor
Feature activation+0.926
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
.
Token.
Feature activation+1.345
McCain
Token McCain
Feature activation+1.106
promoted
Token promoted
Feature activation+1.232
a
Token a
Feature activation+1.260
moratorium
Token moratorium
Feature activation+1.267

Top DFA by src position
MAX = 0.398

early
Token early
Feature activation+0.069
Top resid features:
1995
Token 1995
Feature activation+0.102
Top resid features:
,
Token,
Feature activation+0.224
Top resid features:
after
Token after
Feature activation+0.119
Top resid features:
Republicans
Token Republicans
Feature activation+0.143
Top resid features:
had
Token had
Feature activation+0.261
Top resid features:
taken
Token taken
Feature activation+0.000
Top resid features:
control
Token control
Feature activation+0.000
Top resid features:
of
Token of
Feature activation+0.000
Top resid features:
Congress
Token Congress
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
.
Token.
Feature activation+0.135
Top resid features:
Ċ
TokenĊ
Feature activation+0.241
Top resid features:
Ċ
TokenĊ
Feature activation+0.255
Top resid features:
In
TokenIn
Feature activation+0.109
Top resid features:
early
Token early
Feature activation+0.157
Top resid features:
1995
Token 1995
Feature activation+0.261
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
after
Token after
Feature activation+0.000
Top resid features:
Republicans
Token Republicans
Feature activation+0.000
Top resid features:
had
Token had
Feature activation+0.000
Top resid features:
taken
Token taken
Feature activation+0.000
Top resid features:
Ċ
TokenĊ
Feature activation+0.167
Top resid features:
Ċ
TokenĊ
Feature activation+0.165
Top resid features:
In
TokenIn
Feature activation+0.061
Top resid features:
early
Token early
Feature activation+0.113
Top resid features:
1995
Token 1995
Feature activation+0.152
Top resid features:
,
Token,
Feature activation+0.360
Top resid features:
after
Token after
Feature activation+0.166
Top resid features:
Republicans
Token Republicans
Feature activation+0.000
Top resid features:
had
Token had
Feature activation+0.000
Top resid features:
taken
Token taken
Feature activation+0.000
Top resid features:
control
Token control
Feature activation+0.000
Top resid features:
1995
Token 1995
Feature activation+0.082
Top resid features:
,
Token,
Feature activation+0.209
Top resid features:
after
Token after
Feature activation+0.126
Top resid features:
Republicans
Token Republicans
Feature activation+0.097
Top resid features:
had
Token had
Feature activation+0.011
Top resid features:
taken
Token taken
Feature activation+0.303
Top resid features:
control
Token control
Feature activation+0.000
Top resid features:
of
Token of
Feature activation+0.000
Top resid features:
Congress
Token Congress
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
Mr
Token Mr
Feature activation+0.000
Top resid features:
York
Token York
Feature activation+0.036
Top resid features:
Times
Token Times
Feature activation+0.032
Top resid features:
newsletters
Token newsletters
Feature activation+0.137
Top resid features:
.
Token.
Feature activation+0.132
Top resid features:
Ċ
TokenĊ
Feature activation+0.340
Top resid features:
Ċ
TokenĊ
Feature activation+0.398
Top resid features:
In
TokenIn
Feature activation+0.186
Top resid features:
early
Token early
Feature activation+0.000
Top resid features:
1995
Token 1995
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
after
Token after
Feature activation+0.000
Top resid features:
newsletters
Token newsletters
Feature activation+0.136
Top resid features:
.
Token.
Feature activation+0.135
Top resid features:
Ċ
TokenĊ
Feature activation+0.210
Top resid features:
Ċ
TokenĊ
Feature activation+0.228
Top resid features:
In
TokenIn
Feature activation+0.148
Top resid features:
early
Token early
Feature activation+0.260
Top resid features:
1995
Token 1995
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
after
Token after
Feature activation+0.000
Top resid features:
Republicans
Token Republicans
Feature activation+0.000
Top resid features:
had
Token had
Feature activation+0.000
Top resid features:
Ċ
TokenĊ
Feature activation+0.143
Top resid features:
Ċ
TokenĊ
Feature activation+0.142
Top resid features:
In
TokenIn
Feature activation+0.043
Top resid features:
early
Token early
Feature activation+0.091
Top resid features:
1995
Token 1995
Feature activation+0.095
Top resid features:
,
Token,
Feature activation+0.289
Top resid features:
after
Token after
Feature activation+0.125
Top resid features:
Republicans
Token Republicans
Feature activation+0.188
Top resid features:
had
Token had
Feature activation+0.000
Top resid features:
taken
Token taken
Feature activation+0.000
Top resid features:
control
Token control
Feature activation+0.000
Top resid features:
Ċ
TokenĊ
Feature activation+0.192
Top resid features:
Ċ
TokenĊ
Feature activation+0.200
Top resid features:
In
TokenIn
Feature activation+0.070
Top resid features:
early
Token early
Feature activation+0.153
Top resid features:
1995
Token 1995
Feature activation+0.155
Top resid features:
,
Token,
Feature activation+0.295
Top resid features:
after
Token after
Feature activation+0.000
Top resid features:
Republicans
Token Republicans
Feature activation+0.000
Top resid features:
had
Token had
Feature activation+0.000
Top resid features:
taken
Token taken
Feature activation+0.000
Top resid features:
control
Token control
Feature activation+0.000
Top resid features:
had
Token had
Feature activation+0.068
Top resid features:
taken
Token taken
Feature activation+0.074
Top resid features:
control
Token control
Feature activation+0.068
Top resid features:
of
Token of
Feature activation+0.039
Top resid features:
Congress
Token Congress
Feature activation+0.120
Top resid features:
,
Token,
Feature activation+0.308
Top resid features:
Mr
Token Mr
Feature activation+0.119
Top resid features:
.
Token.
Feature activation+0.142
Top resid features:
McCain
Token McCain
Feature activation+0.000
Top resid features:
promoted
Token promoted
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.165
Top resid features:
after
Token after
Feature activation+0.096
Top resid features:
Republicans
Token Republicans
Feature activation+0.074
Top resid features:
had
Token had
Feature activation+0.140
Top resid features:
taken
Token taken
Feature activation+0.070
Top resid features:
control
Token control
Feature activation+0.197
Top resid features:
of
Token of
Feature activation+0.000
Top resid features:
Congress
Token Congress
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
Mr
Token Mr
Feature activation+0.000
Top resid features:
.
Token.
Feature activation+0.000
Top resid features:
had
Token had
Feature activation+0.111
Top resid features:
taken
Token taken
Feature activation+0.093
Top resid features:
control
Token control
Feature activation+0.090
Top resid features:
of
Token of
Feature activation+0.071
Top resid features:
Congress
Token Congress
Feature activation+0.127
Top resid features:
,
Token,
Feature activation+0.279
Top resid features:
Mr
Token Mr
Feature activation+0.000
Top resid features:
.
Token.
Feature activation+0.000
Top resid features:
McCain
Token McCain
Feature activation+0.000
Top resid features:
promoted
Token promoted
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
I
Token I
Feature activation+0.045
Top resid features:
nder
Tokennder
Feature activation+0.052
Top resid features:
fur
Tokenfur
Feature activation+0.099
Top resid features:
th
Tokenth
Feature activation+0.047
Top resid features:
,
Token,
Feature activation+0.194
Top resid features:
who
Token who
Feature activation+0.267
Top resid features:
is
Token is
Feature activation+0.168
Top resid features:
now
Token now
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
professor
Token professor
Feature activation+0.000
Top resid features:
of
Token of
Feature activation+0.000
Top resid features:
Ċ
TokenĊ
Feature activation+0.107
Top resid features:
Ċ
TokenĊ
Feature activation+0.101
Top resid features:
In
TokenIn
Feature activation+0.042
Top resid features:
early
Token early
Feature activation+0.053
Top resid features:
1995
Token 1995
Feature activation+0.060
Top resid features:
,
Token,
Feature activation+0.158
Top resid features:
after
Token after
Feature activation+0.062
Top resid features:
Republicans
Token Republicans
Feature activation+0.111
Top resid features:
had
Token had
Feature activation+0.118
Top resid features:
taken
Token taken
Feature activation+0.097
Top resid features:
control
Token control
Feature activation+0.153
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.252
Top resid features:
,
Token ,
Feature activation-0.002
Top resid features:
updates
Token updates
Feature activation+0.018
Top resid features:
and
Token and
Feature activation-0.000
Top resid features:
promotions
Token promotions
Feature activation+0.021
Top resid features:
from
Token from
Feature activation-0.007
Top resid features:
âĢ
TokenâĢ
Feature activation+0.030
Top resid features:
ľ
Tokenľ
Feature activation+0.002
Top resid features:
The
TokenThe
Feature activation+0.023
Top resid features:
situation
Token situation
Feature activation+0.031
Top resid features:
has
Token has
Feature activation+0.020
Top resid features:
changed
Token changed
Feature activation+0.265
Top resid features:
significantly
Token significantly
Feature activation-0.011
Top resid features:
in
Token in
Feature activation+0.008
Top resid features:
recent
Token recent
Feature activation+0.046
Top resid features:
years
Token years
Feature activation+0.031
Top resid features:
,
Token,
Feature activation+0.067
Top resid features:
had
Token had
Feature activation+0.044
Top resid features:
taken
Token taken
Feature activation+0.060
Top resid features:
control
Token control
Feature activation+0.035
Top resid features:
of
Token of
Feature activation+0.036
Top resid features:
Congress
Token Congress
Feature activation+0.024
Top resid features:
,
Token,
Feature activation+0.201
Top resid features:
Mr
Token Mr
Feature activation+0.078
Top resid features:
.
Token.
Feature activation+0.106
Top resid features:
McCain
Token McCain
Feature activation+0.150
Top resid features:
promoted
Token promoted
Feature activation+0.163
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:
Republicans
Token Republicans
Feature activation+0.043
Top resid features:
had
Token had
Feature activation+0.077
Top resid features:
taken
Token taken
Feature activation+0.093
Top resid features:
control
Token control
Feature activation+0.113
Top resid features:
of
Token of
Feature activation+0.143
Top resid features:
Congress
Token Congress
Feature activation+0.171
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
Mr
Token Mr
Feature activation+0.000
Top resid features:
.
Token.
Feature activation+0.000
Top resid features:
McCain
Token McCain
Feature activation+0.000
Top resid features:
promoted
Token promoted
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.199
Top resid features:
,
Token ,
Feature activation+0.006
Top resid features:
updates
Token updates
Feature activation+0.030
Top resid features:
and
Token and
Feature activation+0.010
Top resid features:
promotions
Token promotions
Feature activation+0.038
Top resid features:
from
Token from
Feature activation-0.006
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.214
Top resid features:
,
Token ,
Feature activation-0.005
Top resid features:
updates
Token updates
Feature activation+0.009
Top resid features:
and
Token and
Feature activation-0.000
Top resid features:
promotions
Token promotions
Feature activation+0.022
Top resid features:
from
Token from
Feature activation-0.010
Top resid features:
had
Token had
Feature activation+0.061
Top resid features:
taken
Token taken
Feature activation+0.069
Top resid features:
control
Token control
Feature activation+0.056
Top resid features:
of
Token of
Feature activation+0.047
Top resid features:
Congress
Token Congress
Feature activation+0.074
Top resid features:
,
Token,
Feature activation+0.293
Top resid features:
Mr
Token Mr
Feature activation+0.164
Top resid features:
.
Token.
Feature activation+0.000
Top resid features:
McCain
Token McCain
Feature activation+0.000
Top resid features:
promoted
Token promoted
Feature activation+0.000
Top resid features:
a
Token a
Feature activation+0.000
Top resid features:

Decoder Weights Distribution

Head 0: 0.03

Head 1: 0.07

Head 2: 0.07

Head 3: 0.04

Head 4: 0.07

Head 5: 0.07

Head 6: 0.09

Head 7: 0.15

Head 8: 0.15

Head 9: 0.12

Head 10: 0.09

Head 11: 0.06

Positive logits

Mubarak1.38

agna1.25

toget1.22

reset1.20

bread1.18

roads1.15

ドラゴン1.14

Clintons1.14

Bere1.14

flight1.13

ansas1.12

ahime1.11

Slovenia1.10

kowski1.09

rots1.08

enezuel1.07

uay1.06

arat1.06

Libya1.05

imb1.05

Negative logits

ONSORED-1.25

rave-1.17

dx-1.17

nm-1.16

Idol-1.14

Madness-1.13

Hip-1.11

DIT-1.11

liner-1.10

pic-1.10

targeted-1.09

ancies-1.07

pseudonym-1.07

ewitness-1.05

Kin-1.04

KEN-1.02

ops-1.00

fav-1.00

umblr-0.99

lil-0.98

INTERVAL 1.390 - 1.544
CONTAINS 0.001%

In
TokenIn
Feature activation+1.416
early
Token early
Feature activation+1.397
1995
Token 1995
Feature activation+1.506
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
.
Token.
Feature activation+0.180
Ċ
TokenĊ
Feature activation+0.576
Ċ
TokenĊ
Feature activation+0.800
In
TokenIn
Feature activation+1.416
early
Token early
Feature activation+1.397
1995
Token 1995
Feature activation+1.506
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
Ċ
TokenĊ
Feature activation+0.800
In
TokenIn
Feature activation+1.416
early
Token early
Feature activation+1.397
1995
Token 1995
Feature activation+1.506
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
1995
Token 1995
Feature activation+1.506
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
newsletters
Token newsletters
Feature activation+0.000
.
Token.
Feature activation+0.180
Ċ
TokenĊ
Feature activation+0.576
Ċ
TokenĊ
Feature activation+0.800
In
TokenIn
Feature activation+1.416
early
Token early
Feature activation+1.397
1995
Token 1995
Feature activation+1.506
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544

INTERVAL 1.235 - 1.390
CONTAINS 0.001%

âĢ
TokenâĢ
Feature activation+1.026
Ŀ
TokenĿ
Feature activation+0.958
said
Token said
Feature activation+1.079
Mr
Token Mr
Feature activation+1.046
.
Token.
Feature activation+1.077
I
Token I
Feature activation+1.255
nder
Tokennder
Feature activation+1.070
fur
Tokenfur
Feature activation+1.140
th
Tokenth
Feature activation+1.150
,
Token,
Feature activation+1.186
who
Token who
Feature activation+1.160
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
.
Token.
Feature activation+1.345
McCain
Token McCain
Feature activation+1.106
promoted
Token promoted
Feature activation+1.232
a
Token a
Feature activation+1.260
,
Token,
Feature activation+1.355
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
.
Token.
Feature activation+1.345
fur
Tokenfur
Feature activation+1.140
th
Tokenth
Feature activation+1.150
,
Token,
Feature activation+1.186
who
Token who
Feature activation+1.160
is
Token is
Feature activation+1.268
now
Token now
Feature activation+1.235
a
Token a
Feature activation+1.075
professor
Token professor
Feature activation+0.926
of
Token of
Feature activation+0.999
international
Token international
Feature activation+1.002
affairs
Token affairs
Feature activation+0.901
after
Token after
Feature activation+1.455
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
.
Token.
Feature activation+1.345
McCain
Token McCain
Feature activation+1.106

INTERVAL 1.081 - 1.235
CONTAINS 0.002%

situation
Token situation
Feature activation+1.062
has
Token has
Feature activation+1.201
changed
Token changed
Feature activation+1.024
significantly
Token significantly
Feature activation+0.941
in
Token in
Feature activation+0.895
recent
Token recent
Feature activation+1.112
years
Token years
Feature activation+0.861
,
Token,
Feature activation+0.887
âĢ
TokenâĢ
Feature activation+1.026
Ŀ
TokenĿ
Feature activation+0.958
said
Token said
Feature activation+1.079
Republicans
Token Republicans
Feature activation+1.392
had
Token had
Feature activation+1.544
taken
Token taken
Feature activation+1.418
control
Token control
Feature activation+1.325
of
Token of
Feature activation+1.262
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
.
Token.
Feature activation+1.345
McCain
Token McCain
Feature activation+1.106
promoted
Token promoted
Feature activation+1.232
pest
Token pest
Feature activation+0.924
detection
Token detection
Feature activation+0.908
software
Token software
Feature activation+1.031
when
Token when
Feature activation+0.976
they
Token they
Feature activation+1.155
did
Token did
Feature activation+1.092
the
Token the
Feature activation+0.956
Worcester
Token Worcester
Feature activation+0.824
inventory
Token inventory
Feature activation+0.917
,
Token,
Feature activation+1.025
âĢ
TokenâĢ
Feature activation+1.087
If
TokenIf
Feature activation+0.932
they
Token they
Feature activation+0.995
âĢ
TokenâĢ
Feature activation+0.963
Ļ
TokenĻ
Feature activation+0.717
d
Tokend
Feature activation+0.920
had
Token had
Feature activation+1.144
that
Token that
Feature activation+0.882
pest
Token pest
Feature activation+0.924
detection
Token detection
Feature activation+0.908
software
Token software
Feature activation+1.031
when
Token when
Feature activation+0.976
Congress
Token Congress
Feature activation+1.218
,
Token,
Feature activation+1.300
Mr
Token Mr
Feature activation+1.166
.
Token.
Feature activation+1.345
McCain
Token McCain
Feature activation+1.106
promoted
Token promoted
Feature activation+1.232
a
Token a
Feature activation+1.260
moratorium
Token moratorium
Feature activation+1.267
on
Token on
Feature activation+1.140
federal
Token federal
Feature activation+1.033
regulations
Token regulations
Feature activation+1.192

INTERVAL 0.926 - 1.081
CONTAINS 0.004%

âĢ
TokenâĢ
Feature activation+1.087
Ŀ
TokenĿ
Feature activation+0.993
Mr
Token Mr
Feature activation+0.929
.
Token.
Feature activation+1.005
Hil
Token Hil
Feature activation+1.085
man
Tokenman
Feature activation+0.943
said
Token said
Feature activation+1.085
,
Token,
Feature activation+0.955
âĢ
Token âĢ
Feature activation+0.842
ľ
Tokenľ
Feature activation+0.750
and
Tokenand
Feature activation+0.852
âĢ
TokenâĢ
Feature activation+1.074
ľ
Tokenľ
Feature activation+0.766
The
TokenThe
Feature activation+1.009
situation
Token situation
Feature activation+1.062
has
Token has
Feature activation+1.201
changed
Token changed
Feature activation+1.024
significantly
Token significantly
Feature activation+0.941
in
Token in
Feature activation+0.895
recent
Token recent
Feature activation+1.112
years
Token years
Feature activation+0.861
,
Token,
Feature activation+0.887
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.083
Ċ
TokenĊ
Feature activation+0.295
âĢ
TokenâĢ
Feature activation+1.074
ľ
Tokenľ
Feature activation+0.766
The
TokenThe
Feature activation+1.009
situation
Token situation
Feature activation+1.062
has
Token has
Feature activation+1.201
changed
Token changed
Feature activation+1.024
significantly
Token significantly
Feature activation+0.941
in
Token in
Feature activation+0.895
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.158
âĢ
TokenâĢ
Feature activation+0.963
ľ
Tokenľ
Feature activation+0.638
If
TokenIf
Feature activation+0.932
they
Token they
Feature activation+0.995
âĢ
TokenâĢ
Feature activation+0.963
Ļ
TokenĻ
Feature activation+0.717
d
Tokend
Feature activation+0.920
had
Token had
Feature activation+1.144
that
Token that
Feature activation+0.882
Mr
Token Mr
Feature activation+0.929
.
Token.
Feature activation+1.005
Hil
Token Hil
Feature activation+1.085
man
Tokenman
Feature activation+0.943
said
Token said
Feature activation+1.085
,
Token,
Feature activation+0.955
âĢ
Token âĢ
Feature activation+0.842
ľ
Tokenľ
Feature activation+0.750
and
Tokenand
Feature activation+0.852
if
Token if
Feature activation+0.873
they
Token they
Feature activation+0.892

INTERVAL 0.772 - 0.926
CONTAINS 0.003%

every
Token every
Feature activation+0.948
reason
Token reason
Feature activation+0.822
to
Token to
Feature activation+0.895
believe
Token believe
Feature activation+0.907
we
Token we
Feature activation+0.750
could
Token could
Feature activation+0.834
join
Token join
Feature activation+0.890
this
Token this
Feature activation+0.859
treaty
Token treaty
Feature activation+0.670
.
Token.
Feature activation+0.937
âĢ
TokenâĢ
Feature activation+0.801
when
Token when
Feature activation+0.976
they
Token they
Feature activation+1.155
did
Token did
Feature activation+1.092
the
Token the
Feature activation+0.956
Worcester
Token Worcester
Feature activation+0.824
inventory
Token inventory
Feature activation+0.917
,
Token,
Feature activation+1.025
âĢ
TokenâĢ
Feature activation+1.087
Ŀ
TokenĿ
Feature activation+0.993
Mr
Token Mr
Feature activation+0.929
.
Token.
Feature activation+1.005
has
Token has
Feature activation+0.000
occurred
Token occurred
Feature activation+0.119
<|endoftext|>
Token<|endoftext|>
Feature activation+0.453
We
TokenWe
Feature activation+0.461
previously
Token previously
Feature activation+0.554
reported
Token reported
Feature activation+0.901
a
Token a
Feature activation+0.589
growing
Token growing
Feature activation+0.710
trend
Token trend
Feature activation+0.690
that
Token that
Feature activation+0.602
Yes
Token Yes
Feature activation+0.693
could
Token could
Feature activation+0.834
join
Token join
Feature activation+0.890
this
Token this
Feature activation+0.859
treaty
Token treaty
Feature activation+0.670
.
Token.
Feature activation+0.937
âĢ
TokenâĢ
Feature activation+0.801
Ŀ
TokenĿ
Feature activation+0.659
Ċ
TokenĊ
Feature activation+0.674
Ċ
TokenĊ
Feature activation+0.677
Next
TokenNext
Feature activation+1.027
week
Token week
Feature activation+0.933
said
Token said
Feature activation+1.085
,
Token,
Feature activation+0.955
âĢ
Token âĢ
Feature activation+0.842
ľ
Tokenľ
Feature activation+0.750
and
Tokenand
Feature activation+0.852
if
Token if
Feature activation+0.873
they
Token they
Feature activation+0.892
had
Token had
Feature activation+0.879
noticed
Token noticed
Feature activation+0.674
small
Token small
Feature activation+0.609
holes
Token holes
Feature activation+0.677

INTERVAL 0.618 - 0.772
CONTAINS 0.004%

this
Token this
Feature activation+0.859
treaty
Token treaty
Feature activation+0.670
.
Token.
Feature activation+0.937
âĢ
TokenâĢ
Feature activation+0.801
Ŀ
TokenĿ
Feature activation+0.659
Ċ
TokenĊ
Feature activation+0.674
Ċ
TokenĊ
Feature activation+0.677
Next
TokenNext
Feature activation+1.027
week
Token week
Feature activation+0.933
,
Token,
Feature activation+0.744
Senator
Token Senator
Feature activation+0.638
Yes
Token Yes
Feature activation+0.693
was
Token was
Feature activation+0.724
coming
Token coming
Feature activation+0.701
out
Token out
Feature activation+0.631
on
Token on
Feature activation+0.630
top
Token top
Feature activation+0.665
in
Token in
Feature activation+0.640
every
Token every
Feature activation+0.714
debate
Token debate
Feature activation+0.806
on
Token on
Feature activation+0.713
Scotland
Token Scotland
Feature activation+0.424
Next
TokenNext
Feature activation+1.027
week
Token week
Feature activation+0.933
,
Token,
Feature activation+0.744
Senator
Token Senator
Feature activation+0.638
Leah
Token Leah
Feature activation+0.707
y
Tokeny
Feature activation+0.699
plans
Token plans
Feature activation+0.782
to
Token to
Feature activation+0.753
send
Token send
Feature activation+0.804
a
Token a
Feature activation+0.663
letter
Token letter
Feature activation+0.690
on
Token on
Feature activation+0.630
top
Token top
Feature activation+0.665
in
Token in
Feature activation+0.640
every
Token every
Feature activation+0.714
debate
Token debate
Feature activation+0.806
on
Token on
Feature activation+0.713
Scotland
Token Scotland
Feature activation+0.424
âĢ
TokenâĢ
Feature activation+0.487
Ļ
TokenĻ
Feature activation+0.579
s
Tokens
Feature activation+0.698
future
Token future
Feature activation+0.605
was
Token was
Feature activation+0.724
coming
Token coming
Feature activation+0.701
out
Token out
Feature activation+0.631
on
Token on
Feature activation+0.630
top
Token top
Feature activation+0.665
in
Token in
Feature activation+0.640
every
Token every
Feature activation+0.714
debate
Token debate
Feature activation+0.806
on
Token on
Feature activation+0.713
Scotland
Token Scotland
Feature activation+0.424
âĢ
TokenâĢ
Feature activation+0.487

INTERVAL 0.463 - 0.618
CONTAINS 0.005%

occurred
Token occurred
Feature activation+0.119
<|endoftext|>
Token<|endoftext|>
Feature activation+0.453
We
TokenWe
Feature activation+0.461
previously
Token previously
Feature activation+0.554
reported
Token reported
Feature activation+0.901
a
Token a
Feature activation+0.589
growing
Token growing
Feature activation+0.710
trend
Token trend
Feature activation+0.690
that
Token that
Feature activation+0.602
Yes
Token Yes
Feature activation+0.693
was
Token was
Feature activation+0.724
and
Tokenand
Feature activation+0.852
if
Token if
Feature activation+0.873
they
Token they
Feature activation+0.892
had
Token had
Feature activation+0.879
noticed
Token noticed
Feature activation+0.674
small
Token small
Feature activation+0.609
holes
Token holes
Feature activation+0.677
and
Token and
Feature activation+0.525
saw
Token saw
Feature activation+0.594
dust
Tokendust
Feature activation+0.611
piles
Token piles
Feature activation+0.582
Ċ
TokenĊ
Feature activation+0.429
Ċ
TokenĊ
Feature activation+0.416
With
TokenWith
Feature activation+0.859
the
Token the
Feature activation+0.654
software
Token software
Feature activation+0.783
improving
Token improving
Feature activation+0.494
,
Token,
Feature activation+0.488
cities
Token cities
Feature activation+0.563
throughout
Token throughout
Feature activation+0.537
the
Token the
Feature activation+0.557
country
Token country
Feature activation+0.501
in
Token in
Feature activation+0.491
our
Token our
Feature activation+0.390
article
Token article
Feature activation+0.422
âĢ
Token âĢ
Feature activation+0.188
ľ
Tokenľ
Feature activation+0.377
Yes
TokenYes
Feature activation+0.591
winning
Token winning
Feature activation+0.503
63
Token 63
Feature activation+0.391
%
Token%
Feature activation+0.587
to
Token to
Feature activation+0.572
33
Token 33
Feature activation+0.464
error
Token error
Feature activation+0.000
has
Token has
Feature activation+0.000
occurred
Token occurred
Feature activation+0.119
<|endoftext|>
Token<|endoftext|>
Feature activation+0.453
We
TokenWe
Feature activation+0.461
previously
Token previously
Feature activation+0.554
reported
Token reported
Feature activation+0.901
a
Token a
Feature activation+0.589
growing
Token growing
Feature activation+0.710
trend
Token trend
Feature activation+0.690
that
Token that
Feature activation+0.602

INTERVAL 0.309 - 0.463
CONTAINS 0.005%

ke
Tokenke
Feature activation+0.282
spec
Token spec
Feature activation+0.519
ulates
Tokenulates
Feature activation+0.079
that
Token that
Feature activation+0.305
they
Token they
Feature activation+0.410
were
Token were
Feature activation+0.378
exploring
Token exploring
Feature activation+0.198
for
Token for
Feature activation+0.287
even
Token even
Feature activation+0.266
higher
Token higher
Feature activation+0.085
alcohol
Token alcohol
Feature activation+0.078
,
Token,
Feature activation+0.488
cities
Token cities
Feature activation+0.563
throughout
Token throughout
Feature activation+0.537
the
Token the
Feature activation+0.557
country
Token country
Feature activation+0.501
look
Token look
Feature activation+0.370
to
Token to
Feature activation+0.540
tree
Token tree
Feature activation+0.332
invent
Token invent
Feature activation+0.470
ories
Tokenories
Feature activation+0.172
as
Token as
Feature activation+0.258
to
Token to
Feature activation+0.396
demonstrate
Token demonstrate
Feature activation+0.333
that
Token that
Feature activation+0.443
people
Token people
Feature activation+0.367
move
Token move
Feature activation+0.361
towards
Token towards
Feature activation+0.423
voting
Token voting
Feature activation+0.303
Yes
Token Yes
Feature activation+0.225
after
Token after
Feature activation+0.258
engaging
Token engaging
Feature activation+0.061
with
Token with
Feature activation+0.168
other
Token other
Feature activation+0.307
side
Token side
Feature activation+0.154
after
Token after
Feature activation+0.207
24
Token 24
Feature activation+0.078
hours
Token hours
Feature activation+0.411
.
Token.
Feature activation+0.376
Many
Token Many
Feature activation+0.485
infected
Token infected
Feature activation+0.088
larvae
Token larvae
Feature activation+0.075
started
Token started
Feature activation+0.410
moving
Token moving
Feature activation+0.129
.
Token.
Feature activation+0.000
More
Token More
Feature activation+0.000
newsletters
Token newsletters
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.438
court
Token court
Feature activation+0.491
documents
Token documents
Feature activation+0.550
also
Token also
Feature activation+0.604
said
Token said
Feature activation+0.626
Taylor
Token Taylor
Feature activation+0.427

INTERVAL 0.154 - 0.309
CONTAINS 0.005%

40
Token 40
Feature activation+0.408
percent
Token percent
Feature activation+0.616
of
Token of
Feature activation+0.329
the
Token the
Feature activation+0.385
healthy
Token healthy
Feature activation+0.263
flies
Token flies
Feature activation+0.157
crawled
Token crawled
Feature activation+0.279
to
Token to
Feature activation+0.338
the
Token the
Feature activation+0.313
other
Token other
Feature activation+0.307
side
Token side
Feature activation+0.154
that
Token that
Feature activation+0.305
they
Token they
Feature activation+0.410
were
Token were
Feature activation+0.378
exploring
Token exploring
Feature activation+0.198
for
Token for
Feature activation+0.287
even
Token even
Feature activation+0.266
higher
Token higher
Feature activation+0.085
alcohol
Token alcohol
Feature activation+0.078
concentrations
Token concentrations
Feature activation+0.024
that
Token that
Feature activation+0.106
would
Token would
Feature activation+0.104
well
Token well
Feature activation+0.000
,
Token,
Feature activation+0.226
but
Token but
Feature activation+0.305
then
Token then
Feature activation+0.277
returned
Token returned
Feature activation+0.162
to
Token to
Feature activation+0.194
the
Token the
Feature activation+0.146
alcohol
Token alcohol
Feature activation+0.000
.
Token.
Feature activation+0.331
Dr
Token Dr
Feature activation+0.153
.
Token.
Feature activation+0.359
to
Token to
Feature activation+0.572
33
Token 33
Feature activation+0.464
%
Token%
Feature activation+0.590
after
Token after
Feature activation+0.437
three
Token three
Feature activation+0.363
post
Token post
Feature activation+0.277
debate
Token debate
Feature activation+0.329
polls
Token polls
Feature activation+0.261
âĢ
TokenâĢ
Feature activation+0.235
Ŀ
TokenĿ
Feature activation+0.277
Ċ
TokenĊ
Feature activation+0.057
s
Tokens
Feature activation+0.698
future
Token future
Feature activation+0.605
in
Token in
Feature activation+0.491
our
Token our
Feature activation+0.390
article
Token article
Feature activation+0.422
âĢ
Token âĢ
Feature activation+0.188
ľ
Tokenľ
Feature activation+0.377
Yes
TokenYes
Feature activation+0.591
winning
Token winning
Feature activation+0.503
63
Token 63
Feature activation+0.391
%
Token%
Feature activation+0.587

INTERVAL 0.000 - 0.154
CONTAINS 99.970%

economy
Token economy
Feature activation+0.000
and
Token and
Feature activation+0.000
more
Token more
Feature activation+0.000
jobs
Token jobs
Feature activation+0.000
'
Token'
Feature activation+0.000
are
Token are
Feature activation+0.000
a
Token a
Feature activation+0.000
stronger
Token stronger
Feature activation+0.000
deterrent
Token deterrent
Feature activation+0.000
of
Token of
Feature activation+0.000
violent
Token violent
Feature activation+0.000
recommend
Token recommend
Feature activation+0.000
it
Token it
Feature activation+0.000
to
Token to
Feature activation+0.000
everyone
Token everyone
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
R
TokenR
Feature activation+0.000
ising
Tokenising
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
House
Token House
Feature activation+0.000
named
Token named
Feature activation+0.000
Jo
Token Jo
Feature activation+0.000
ost
Tokenost
Feature activation+0.000
El
Token El
Feature activation+0.000
ff
Tokenff
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
.[
Token.[
Feature activation+0.000
4
Token4
Feature activation+0.000
][
Token][
Feature activation+0.000
8
Token8
Feature activation+0.000
]
Token]
Feature activation+0.000
makes
Token makes
Feature activation+0.000
it
Token it
Feature activation+0.000
harder
Token harder
Feature activation+0.000
for
Token for
Feature activation+0.000
water
Token water
Feature activation+0.000
molecules
Token molecules
Feature activation+0.000
to
Token to
Feature activation+0.000
bond
Token bond
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
ice
Token ice
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
P
TokenP
Feature activation+0.000
ractical
Tokenractical
Feature activation+0.000
how
Token how
Feature activation+0.000
-
Token-
Feature activation+0.000
to
Tokento
Feature activation+0.000
steps
Token steps
Feature activation+0.000
to
Token to
Feature activation+0.000
protect
Token protect
Feature activation+0.000
yourself
Token yourself
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 7 in H1.8: (feature 10160

TOP ACTIVATIONS
MAX = 0.986

just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
my
Token my
Feature activation+0.582
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
ãģŁ
TokenãģŁ
Feature activation+0.000
ãĢĤ
TokenãĢĤ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
When
TokenWhen
Feature activation+0.563
I
Token I
Feature activation+0.741
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
my
Token my
Feature activation+0.582
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
When
TokenWhen
Feature activation+0.563
I
Token I
Feature activation+0.741
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
my
Token my
Feature activation+0.582
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
my
Token my
Feature activation+0.582
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
ãģĹ
TokenãģĹ
Feature activation+0.000
ãģŁ
TokenãģŁ
Feature activation+0.000
ãĢĤ
TokenãĢĤ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
When
TokenWhen
Feature activation+0.563
I
Token I
Feature activation+0.741
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
I
Token I
Feature activation+0.741
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
my
Token my
Feature activation+0.582
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
my
Token my
Feature activation+0.582
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
â̦.
Tokenâ̦.
Feature activation+0.484
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
â̦.
Tokenâ̦.
Feature activation+0.484
but
Token but
Feature activation+0.355
yes
Token yes
Feature activation+0.295
,
Token,
Feature activation+0.160
I
Token I
Feature activation+0.392
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
â̦.
Tokenâ̦.
Feature activation+0.484
but
Token but
Feature activation+0.355
yes
Token yes
Feature activation+0.295
next
Token next
Feature activation+0.387
â̦.
Tokenâ̦.
Feature activation+0.484
but
Token but
Feature activation+0.355
yes
Token yes
Feature activation+0.295
,
Token,
Feature activation+0.160
I
Token I
Feature activation+0.392
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
m
Tokenm
Feature activation+0.038
graduating
Token graduating
Feature activation+0.325
.
Token.
Feature activation+0.086
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
â̦.
Tokenâ̦.
Feature activation+0.484
but
Token but
Feature activation+0.355
yes
Token yes
Feature activation+0.295
,
Token,
Feature activation+0.160
I
Token I
Feature activation+0.392
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
â̦.
Tokenâ̦.
Feature activation+0.484
but
Token but
Feature activation+0.355
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
â̦.
Tokenâ̦.
Feature activation+0.484
but
Token but
Feature activation+0.355
yes
Token yes
Feature activation+0.295
,
Token,
Feature activation+0.160
I
Token I
Feature activation+0.392
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000

Top DFA by src position
MAX = 0.396

ve
Tokenve
Feature activation+0.025
Top resid features:
just
Token just
Feature activation+0.074
Top resid features:
made
Token made
Feature activation+0.325
Top resid features:
my
Token my
Feature activation+0.136
Top resid features:
announcement
Token announcement
Feature activation+0.290
Top resid features:
,
Token,
Feature activation+0.338
Top resid features:
it
Token it
Feature activation+0.298
Top resid features:
âĢ
TokenâĢ
Feature activation+0.000
Top resid features:
Ļ
TokenĻ
Feature activation+0.000
Top resid features:
s
Tokens
Feature activation+0.000
Top resid features:
hard
Token hard
Feature activation+0.000
Top resid features:
ãģĹ
TokenãģĹ
Feature activation+0.146
Top resid features:
ãģŁ
TokenãģŁ
Feature activation+0.093
Top resid features:
ãĢĤ
TokenãĢĤ
Feature activation+0.043
Top resid features:
Ċ
TokenĊ
Feature activation+0.257
Top resid features:
Ċ
TokenĊ
Feature activation+0.209
Top resid features:
When
TokenWhen
Feature activation+0.336
Top resid features:
I
Token I
Feature activation+0.203
Top resid features:
âĢ
TokenâĢ
Feature activation+0.000
Top resid features:
Ļ
TokenĻ
Feature activation+0.000
Top resid features:
ve
Tokenve
Feature activation+0.000
Top resid features:
just
Token just
Feature activation+0.000
Top resid features:
it
Token it
Feature activation+0.115
Top resid features:
âĢ
TokenâĢ
Feature activation+0.009
Top resid features:
Ļ
TokenĻ
Feature activation+0.079
Top resid features:
s
Tokens
Feature activation+0.088
Top resid features:
hard
Token hard
Feature activation+0.165
Top resid features:
to
Token to
Feature activation+0.322
Top resid features:
think
Token think
Feature activation+0.181
Top resid features:
about
Token about
Feature activation+0.000
Top resid features:
what
Token what
Feature activation+0.000
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
talking
Token talking
Feature activation+0.000
Top resid features:
ve
Tokenve
Feature activation+0.024
Top resid features:
just
Token just
Feature activation+0.108
Top resid features:
made
Token made
Feature activation+0.238
Top resid features:
my
Token my
Feature activation+0.119
Top resid features:
announcement
Token announcement
Feature activation+0.275
Top resid features:
,
Token,
Feature activation+0.396
Top resid features:
it
Token it
Feature activation+0.000
Top resid features:
âĢ
TokenâĢ
Feature activation+0.000
Top resid features:
Ļ
TokenĻ
Feature activation+0.000
Top resid features:
s
Tokens
Feature activation+0.000
Top resid features:
hard
Token hard
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.206
Top resid features:
it
Token it
Feature activation+0.124
Top resid features:
âĢ
TokenâĢ
Feature activation+0.008
Top resid features:
Ļ
TokenĻ
Feature activation+0.070
Top resid features:
s
Tokens
Feature activation+0.117
Top resid features:
hard
Token hard
Feature activation+0.314
Top resid features:
to
Token to
Feature activation+0.227
Top resid features:
think
Token think
Feature activation+0.000
Top resid features:
about
Token about
Feature activation+0.000
Top resid features:
what
Token what
Feature activation+0.000
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
ãģĹ
TokenãģĹ
Feature activation+0.129
Top resid features:
ãģŁ
TokenãģŁ
Feature activation+0.093
Top resid features:
ãĢĤ
TokenãĢĤ
Feature activation+0.045
Top resid features:
Ċ
TokenĊ
Feature activation+0.206
Top resid features:
Ċ
TokenĊ
Feature activation+0.170
Top resid features:
When
TokenWhen
Feature activation+0.316
Top resid features:
I
Token I
Feature activation+0.107
Top resid features:
âĢ
TokenâĢ
Feature activation+0.068
Top resid features:
Ļ
TokenĻ
Feature activation+0.134
Top resid features:
ve
Tokenve
Feature activation-0.006
Top resid features:
just
Token just
Feature activation+0.201
Top resid features:
I
Token I
Feature activation+0.140
Top resid features:
âĢ
TokenâĢ
Feature activation+0.083
Top resid features:
Ļ
TokenĻ
Feature activation+0.112
Top resid features:
ve
Tokenve
Feature activation+0.020
Top resid features:
just
Token just
Feature activation+0.089
Top resid features:
made
Token made
Feature activation+0.393
Top resid features:
my
Token my
Feature activation+0.180
Top resid features:
announcement
Token announcement
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
it
Token it
Feature activation+0.000
Top resid features:
âĢ
TokenâĢ
Feature activation+0.000
Top resid features:
ģ
Tokenģ
Feature activation+0.150
Top resid features:
ãģ¾
Tokenãģ¾
Feature activation+0.130
Top resid features:
ãģĹ
TokenãģĹ
Feature activation+0.140
Top resid features:
ãģŁ
TokenãģŁ
Feature activation+0.084
Top resid features:
ãĢĤ
TokenãĢĤ
Feature activation+0.031
Top resid features:
Ċ
TokenĊ
Feature activation+0.284
Top resid features:
Ċ
TokenĊ
Feature activation+0.224
Top resid features:
When
TokenWhen
Feature activation+0.211
Top resid features:
I
Token I
Feature activation+0.000
Top resid features:
âĢ
TokenâĢ
Feature activation+0.000
Top resid features:
Ļ
TokenĻ
Feature activation+0.000
Top resid features:
I
Token I
Feature activation+0.096
Top resid features:
âĢ
TokenâĢ
Feature activation+0.059
Top resid features:
Ļ
TokenĻ
Feature activation+0.122
Top resid features:
ve
Tokenve
Feature activation+0.018
Top resid features:
just
Token just
Feature activation+0.124
Top resid features:
made
Token made
Feature activation+0.281
Top resid features:
my
Token my
Feature activation+0.000
Top resid features:
announcement
Token announcement
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
it
Token it
Feature activation+0.000
Top resid features:
âĢ
TokenâĢ
Feature activation+0.000
Top resid features:
I
Token I
Feature activation+0.061
Top resid features:
âĢ
TokenâĢ
Feature activation+0.065
Top resid features:
Ļ
TokenĻ
Feature activation+0.108
Top resid features:
ve
Tokenve
Feature activation+0.032
Top resid features:
just
Token just
Feature activation+0.070
Top resid features:
made
Token made
Feature activation+0.381
Top resid features:
my
Token my
Feature activation+0.123
Top resid features:
announcement
Token announcement
Feature activation+0.334
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
it
Token it
Feature activation+0.000
Top resid features:
âĢ
TokenâĢ
Feature activation+0.000
Top resid features:
s
Tokens
Feature activation+0.092
Top resid features:
hard
Token hard
Feature activation+0.079
Top resid features:
to
Token to
Feature activation+0.224
Top resid features:
think
Token think
Feature activation+0.227
Top resid features:
about
Token about
Feature activation+0.173
Top resid features:
what
Token what
Feature activation+0.234
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
talking
Token talking
Feature activation+0.000
Top resid features:
about
Token about
Feature activation+0.000
Top resid features:
next
Token next
Feature activation+0.000
Top resid features:
â̦.
Tokenâ̦.
Feature activation+0.000
Top resid features:
what
Token what
Feature activation+0.125
Top resid features:
to
Token to
Feature activation+0.095
Top resid features:
talking
Token talking
Feature activation+0.065
Top resid features:
about
Token about
Feature activation+0.130
Top resid features:
next
Token next
Feature activation+0.073
Top resid features:
â̦.
Tokenâ̦.
Feature activation+0.277
Top resid features:
but
Token but
Feature activation+0.000
Top resid features:
yes
Token yes
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
I
Token I
Feature activation+0.000
Top resid features:
âĢ
TokenâĢ
Feature activation+0.000
Top resid features:
âĢ
TokenâĢ
Feature activation-0.004
Top resid features:
Ļ
TokenĻ
Feature activation+0.064
Top resid features:
s
Tokens
Feature activation+0.087
Top resid features:
hard
Token hard
Feature activation+0.129
Top resid features:
to
Token to
Feature activation+0.211
Top resid features:
think
Token think
Feature activation+0.263
Top resid features:
about
Token about
Feature activation+0.178
Top resid features:
what
Token what
Feature activation+0.000
Top resid features:
to
Token to
Feature activation+0.000
Top resid features:
talking
Token talking
Feature activation+0.000
Top resid features:
about
Token about
Feature activation+0.000
Top resid features:
s
Tokens
Feature activation+0.063
Top resid features:
hard
Token hard
Feature activation+0.083
Top resid features:
to
Token to
Feature activation+0.162
Top resid features:
think
Token think
Feature activation+0.113
Top resid features:
about
Token about
Feature activation+0.136
Top resid features:
what
Token what
Feature activation+0.189
Top resid features:
to
Token to
Feature activation+0.175
Top resid features:
talking
Token talking
Feature activation+0.187
Top resid features:
about
Token about
Feature activation+0.000
Top resid features:
next
Token next
Feature activation+0.000
Top resid features:
â̦.
Tokenâ̦.
Feature activation+0.000
Top resid features:
next
Token next
Feature activation+0.055
Top resid features:
â̦.
Tokenâ̦.
Feature activation+0.156
Top resid features:
but
Token but
Feature activation+0.154
Top resid features:
yes
Token yes
Feature activation-0.018
Top resid features:
,
Token,
Feature activation+0.199
Top resid features:
I
Token I
Feature activation+0.206
Top resid features:
âĢ
TokenâĢ
Feature activation+0.000
Top resid features:
Ļ
TokenĻ
Feature activation+0.000
Top resid features:
m
Tokenm
Feature activation+0.000
Top resid features:
graduating
Token graduating
Feature activation+0.000
Top resid features:
.
Token.
Feature activation+0.000
Top resid features:
think
Token think
Feature activation+0.101
Top resid features:
about
Token about
Feature activation+0.130
Top resid features:
what
Token what
Feature activation+0.155
Top resid features:
to
Token to
Feature activation+0.159
Top resid features:
talking
Token talking
Feature activation+0.109
Top resid features:
about
Token about
Feature activation+0.160
Top resid features:
next
Token next
Feature activation+0.073
Top resid features:
â̦.
Tokenâ̦.
Feature activation+0.000
Top resid features:
but
Token but
Feature activation+0.000
Top resid features:
yes
Token yes
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
ve
Tokenve
Feature activation+0.015
Top resid features:
just
Token just
Feature activation+0.064
Top resid features:
made
Token made
Feature activation+0.166
Top resid features:
my
Token my
Feature activation+0.093
Top resid features:
announcement
Token announcement
Feature activation+0.206
Top resid features:
,
Token,
Feature activation+0.260
Top resid features:
it
Token it
Feature activation+0.128
Top resid features:
âĢ
TokenâĢ
Feature activation-0.033
Top resid features:
Ļ
TokenĻ
Feature activation+0.086
Top resid features:
s
Tokens
Feature activation+0.123
Top resid features:
hard
Token hard
Feature activation+0.000
Top resid features:
ve
Tokenve
Feature activation+0.008
Top resid features:
just
Token just
Feature activation+0.049
Top resid features:
made
Token made
Feature activation+0.162
Top resid features:
my
Token my
Feature activation+0.083
Top resid features:
announcement
Token announcement
Feature activation+0.240
Top resid features:
,
Token,
Feature activation+0.259
Top resid features:
it
Token it
Feature activation+0.101
Top resid features:
âĢ
TokenâĢ
Feature activation+0.026
Top resid features:
Ļ
TokenĻ
Feature activation+0.097
Top resid features:
s
Tokens
Feature activation+0.094
Top resid features:
hard
Token hard
Feature activation+0.197
Top resid features:
s
Tokens
Feature activation+0.088
Top resid features:
hard
Token hard
Feature activation+0.087
Top resid features:
to
Token to
Feature activation+0.182
Top resid features:
think
Token think
Feature activation+0.159
Top resid features:
about
Token about
Feature activation+0.152
Top resid features:
what
Token what
Feature activation+0.212
Top resid features:
to
Token to
Feature activation+0.188
Top resid features:
talking
Token talking
Feature activation+0.000
Top resid features:
about
Token about
Feature activation+0.000
Top resid features:
next
Token next
Feature activation+0.000
Top resid features:
â̦.
Tokenâ̦.
Feature activation+0.000
Top resid features:
to
Token to
Feature activation+0.102
Top resid features:
talking
Token talking
Feature activation+0.065
Top resid features:
about
Token about
Feature activation+0.134
Top resid features:
next
Token next
Feature activation+0.075
Top resid features:
â̦.
Tokenâ̦.
Feature activation+0.104
Top resid features:
but
Token but
Feature activation+0.201
Top resid features:
yes
Token yes
Feature activation+0.000
Top resid features:
,
Token,
Feature activation+0.000
Top resid features:
I
Token I
Feature activation+0.000
Top resid features:
âĢ
TokenâĢ
Feature activation+0.000
Top resid features:
Ļ
TokenĻ
Feature activation+0.000
Top resid features:

Decoder Weights Distribution

Head 0: 0.05

Head 1: 0.07

Head 2: 0.07

Head 3: 0.03

Head 4: 0.07

Head 5: 0.05

Head 6: 0.08

Head 7: 0.17

Head 8: 0.15

Head 9: 0.10

Head 10: 0.08

Head 11: 0.07

Positive logits

utical1.78

sama1.77

ğ1.74

1.70

izu1.70

translation1.65

aeda1.63

ł1.63

ドラゴン1.62

1.61

1.60

uese1.60

aterasu1.59

hai1.59

osher1.58

ailand1.58

ティ1.58

1.57

amina1.55

1.54

Negative logits

Racer-1.77

McM-1.56

AOL-1.49

Hampton-1.48

Ghostbusters-1.46

Breakfast-1.44

McGr-1.44

Mitch-1.41

Harvey-1.40

NFL-1.38

Mayhem-1.36

Omaha-1.36

Craw-1.35

McInt-1.35

Ford-1.33

EAR-1.30

glers-1.30

MORE-1.29

Crash-1.29

Cleveland-1.29

INTERVAL 0.887 - 0.986
CONTAINS 0.000%

just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
my
Token my
Feature activation+0.582
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595

INTERVAL 0.789 - 0.887
CONTAINS 0.000%

INTERVAL 0.690 - 0.789
CONTAINS 0.000%

ãģŁ
TokenãģŁ
Feature activation+0.000
ãĢĤ
TokenãĢĤ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
When
TokenWhen
Feature activation+0.563
I
Token I
Feature activation+0.741
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547

INTERVAL 0.592 - 0.690
CONTAINS 0.000%

When
TokenWhen
Feature activation+0.563
I
Token I
Feature activation+0.741
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
my
Token my
Feature activation+0.582
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
my
Token my
Feature activation+0.582
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309

INTERVAL 0.493 - 0.592
CONTAINS 0.001%

ãģĹ
TokenãģĹ
Feature activation+0.000
ãģŁ
TokenãģŁ
Feature activation+0.000
ãĢĤ
TokenãĢĤ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
When
TokenWhen
Feature activation+0.563
I
Token I
Feature activation+0.741
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
I
Token I
Feature activation+0.741
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
my
Token my
Feature activation+0.582
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
my
Token my
Feature activation+0.582
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
my
Token my
Feature activation+0.582
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
â̦.
Tokenâ̦.
Feature activation+0.484

INTERVAL 0.394 - 0.493
CONTAINS 0.000%

Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
â̦.
Tokenâ̦.
Feature activation+0.484
but
Token but
Feature activation+0.355
yes
Token yes
Feature activation+0.295
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
â̦.
Tokenâ̦.
Feature activation+0.484
but
Token but
Feature activation+0.355
yes
Token yes
Feature activation+0.295
,
Token,
Feature activation+0.160
I
Token I
Feature activation+0.392
âĢ
TokenâĢ
Feature activation+0.000

INTERVAL 0.296 - 0.394
CONTAINS 0.001%

think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
â̦.
Tokenâ̦.
Feature activation+0.484
but
Token but
Feature activation+0.355
yes
Token yes
Feature activation+0.295
,
Token,
Feature activation+0.160
,
Token,
Feature activation+0.600
it
Token it
Feature activation+0.979
âĢ
TokenâĢ
Feature activation+0.053
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.382
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
hard
Token hard
Feature activation+0.379
to
Token to
Feature activation+0.595
think
Token think
Feature activation+0.624
about
Token about
Feature activation+0.478
what
Token what
Feature activation+0.493
to
Token to
Feature activation+0.370
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
â̦.
Tokenâ̦.
Feature activation+0.484
but
Token but
Feature activation+0.355
,
Token,
Feature activation+0.160
I
Token I
Feature activation+0.392
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
m
Tokenm
Feature activation+0.038
graduating
Token graduating
Feature activation+0.325
.
Token.
Feature activation+0.086
Ċ
TokenĊ
Feature activation+0.015
Ċ
TokenĊ
Feature activation+0.000
ãģĦ
TokenãģĦ
Feature activation+0.000
ãģ
Tokenãģ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
When
TokenWhen
Feature activation+0.563
I
Token I
Feature activation+0.741
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.342
just
Token just
Feature activation+0.593
made
Token made
Feature activation+0.547
my
Token my
Feature activation+0.582
announcement
Token announcement
Feature activation+0.542
,
Token,
Feature activation+0.600

INTERVAL 0.197 - 0.296
CONTAINS 0.001%

On
Token On
Feature activation+0.000
it
Token it
Feature activation+0.000
âĢĵ
Token âĢĵ
Feature activation+0.000
When
Token When
Feature activation+0.000
you
Token you
Feature activation+0.000
understand
Token understand
Feature activation+0.201
the
Token the
Feature activation+0.000
request
Token request
Feature activation+0.184
and
Token and
Feature activation+0.000
are
Token are
Feature activation+0.020
actively
Token actively
Feature activation+0.000
talking
Token talking
Feature activation+0.441
about
Token about
Feature activation+0.309
next
Token next
Feature activation+0.387
â̦.
Tokenâ̦.
Feature activation+0.484
but
Token but
Feature activation+0.355
yes
Token yes
Feature activation+0.295
,
Token,
Feature activation+0.160
I
Token I
Feature activation+0.392
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
m
Tokenm
Feature activation+0.038
team
Token team
Feature activation+0.000
had
Token had
Feature activation+0.000
her
Token her
Feature activation+0.000
release
Token release
Feature activation+0.000
the
Token the
Feature activation+0.000
passengers
Token passengers
Feature activation+0.247
and
Token and
Feature activation+0.000
then
Token then
Feature activation+0.000
surrender
Token surrender
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
someone
Token someone
Feature activation+0.112
or
Token or
Feature activation+0.000
something
Token something
Feature activation+0.000
is
Token is
Feature activation+0.017
en
Token en
Feature activation+0.000
route
Token route
Feature activation+0.269
;
Token;
Feature activation+0.000
as
Token as
Feature activation+0.000
in
Token in
Feature activation+0.000
,
Token,
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
how
Token how
Feature activation+0.025
you
Token you
Feature activation+0.072
want
Token want
Feature activation+0.145
to
Token to
Feature activation+0.130
read
Token read
Feature activation+0.295
it
Token it
Feature activation+0.199
,
Token,
Feature activation+0.185
they
Token they
Feature activation+0.050
function
Token function
Feature activation+0.143
on
Token on
Feature activation+0.100
the
Token the
Feature activation+0.101

INTERVAL 0.099 - 0.197
CONTAINS 0.002%

read
Token read
Feature activation+0.295
it
Token it
Feature activation+0.199
,
Token,
Feature activation+0.185
they
Token they
Feature activation+0.050
function
Token function
Feature activation+0.143
on
Token on
Feature activation+0.100
the
Token the
Feature activation+0.101
same
Token same
Feature activation+0.069
principle
Token principle
Feature activation+0.000
)
Token)
Feature activation+0.000
å¤
Token å¤
Feature activation+0.000
join
Token join
Feature activation+0.000
,
Token,
Feature activation+0.000
rarely
Token rarely
Feature activation+0.000
branch
Token branch
Feature activation+0.000
and
Token and
Feature activation+0.000
flow
Token flow
Feature activation+0.111
out
Token out
Feature activation+0.000
to
Token to
Feature activation+0.000
sea
Token sea
Feature activation+0.105
),
Token),
Feature activation+0.000
but
Token but
Feature activation+0.000
(
Token (
Feature activation+0.000
depending
Tokendepending
Feature activation+0.000
on
Token on
Feature activation+0.000
how
Token how
Feature activation+0.025
you
Token you
Feature activation+0.072
want
Token want
Feature activation+0.145
to
Token to
Feature activation+0.130
read
Token read
Feature activation+0.295
it
Token it
Feature activation+0.199
,
Token,
Feature activation+0.185
they
Token they
Feature activation+0.050
Ŀ
TokenĿ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
âĢĵ
TokenâĢĵ
Feature activation+0.000
When
Token When
Feature activation+0.000
someone
Token someone
Feature activation+0.112
or
Token or
Feature activation+0.000
something
Token something
Feature activation+0.000
is
Token is
Feature activation+0.017
en
Token en
Feature activation+0.000
route
Token route
Feature activation+0.269
depending
Tokendepending
Feature activation+0.000
on
Token on
Feature activation+0.000
how
Token how
Feature activation+0.025
you
Token you
Feature activation+0.072
want
Token want
Feature activation+0.145
to
Token to
Feature activation+0.130
read
Token read
Feature activation+0.295
it
Token it
Feature activation+0.199
,
Token,
Feature activation+0.185
they
Token they
Feature activation+0.050
function
Token function
Feature activation+0.143

INTERVAL 0.000 - 0.099
CONTAINS 99.995%

ľ
Tokenľ
Feature activation+0.000
invest
Tokeninvest
Feature activation+0.000
igate
Tokenigate
Feature activation+0.000
and
Token and
Feature activation+0.000
deal
Token deal
Feature activation+0.000
with
Token with
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
Rebel
Token Rebel
Feature activation+0.000
had
Token had
Feature activation+0.000
been
Token been
Feature activation+0.000
of
Token of
Feature activation+0.000
a
Token a
Feature activation+0.000
French
Token French
Feature activation+0.000
woman
Token woman
Feature activation+0.000
and
Token and
Feature activation+0.000
an
Token an
Feature activation+0.000
Alger
Token Alger
Feature activation+0.000
ian
Tokenian
Feature activation+0.000
man
Token man
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
major
Token major
Feature activation+0.000
police
Token police
Feature activation+0.000
departments
Token departments
Feature activation+0.000
,
Token,
Feature activation+0.000
in
Token in
Feature activation+0.000
Texas
Token Texas
Feature activation+0.000
,
Token,
Feature activation+0.000
Florida
Token Florida
Feature activation+0.000
and
Token and
Feature activation+0.000
California
Token California
Feature activation+0.000
.
Token.
Feature activation+0.000
a
Token a
Feature activation+0.000
Race
Token Race
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
Top
Token Top
Feature activation+0.000
fan
Token fan
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Barn
TokenBarn
Feature activation+0.000
es
Tokenes
Feature activation+0.000
huge
Token huge
Feature activation+0.000
disaster
Token disaster
Feature activation+0.000
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
United
Token United
Feature activation+0.000
States
Token States
Feature activation+0.000
has
Token has
Feature activation+0.000
shame
Token shame
Feature activation+0.000
fully
Tokenfully
Feature activation+0.000
high
Token high
Feature activation+0.000
poverty
Token poverty
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 8 in H1.8: (feature 1773

TOP ACTIVATIONS
MAX = 0.031

)
Token)
Feature activation+0.000
(
Token (
Feature activation+0.000
H
TokenH
Feature activation+0.000
ind
Tokenind
Feature activation+0.000
i
Tokeni
Feature activation+0.000
:
Token:
Feature activation+0.031
à¤
Token à¤
Feature activation+0.000
¦
Token¦
Feature activation+0.000
à¤
Tokenà¤
Feature activation+0.000
¸
Token¸
Feature activation+0.000
à¥
Tokenà¥
Feature activation+0.000
f
Tokenf
Feature activation+0.000
max
Tokenmax
Feature activation+0.000
.
Token.
Feature activation+0.000
jpg
Tokenjpg
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
Yesterday
TokenYesterday
Feature activation+0.001
I
Token I
Feature activation+0.000
shared
Token shared
Feature activation+0.000
my
Token my
Feature activation+0.000
thoughts
Token thoughts
Feature activation+0.000
about
Token about
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000

Top DFA by src position
MAX = 1.036

<|endoftext|>
Token<|endoftext|>
Feature activation+0.427
Top resid features:
failed
Token failed
Feature activation+0.000
Top resid features:
to
Token to
Feature activation+0.016
Top resid features:
win
Token win
Feature activation-0.024
Top resid features:
a
Token a
Feature activation+0.017
Top resid features:
plurality
Token plurality
Feature activation-0.012
Top resid features:
_
Token_
Feature activation+0.019
Top resid features:
f
Tokenf
Feature activation+0.023
Top resid features:
max
Tokenmax
Feature activation+0.007
Top resid features:
.
Token.
Feature activation+0.035
Top resid features:
jpg
Tokenjpg
Feature activation+0.047
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.678
Top resid features:
Yesterday
TokenYesterday
Feature activation+0.532
Top resid features:
I
Token I
Feature activation+0.000
Top resid features:
shared
Token shared
Feature activation+0.000
Top resid features:
my
Token my
Feature activation+0.000
Top resid features:
thoughts
Token thoughts
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.562
Top resid features:
ile
Tokenile
Feature activation+0.095
Top resid features:
(
Token (
Feature activation+0.049
Top resid features:
Michael
TokenMichael
Feature activation+0.017
Top resid features:
C
Token C
Feature activation+0.010
Top resid features:
aine
Tokenaine
Feature activation+0.055
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.635
Top resid features:
ile
Tokenile
Feature activation+0.051
Top resid features:
(
Token (
Feature activation+0.046
Top resid features:
Michael
TokenMichael
Feature activation+0.012
Top resid features:
C
Token C
Feature activation+0.013
Top resid features:
aine
Tokenaine
Feature activation+0.059
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+1.036
Top resid features:
ile
Tokenile
Feature activation+0.044
Top resid features:
(
Token (
Feature activation+0.025
Top resid features:
Michael
TokenMichael
Feature activation+0.077
Top resid features:
C
Token C
Feature activation+0.310
Top resid features:
aine
Tokenaine
Feature activation-0.217
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.866
Top resid features:
ile
Tokenile
Feature activation+0.082
Top resid features:
(
Token (
Feature activation+0.072
Top resid features:
Michael
TokenMichael
Feature activation+0.014
Top resid features:
C
Token C
Feature activation+0.052
Top resid features:
aine
Tokenaine
Feature activation+0.005
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.655
Top resid features:
ile
Tokenile
Feature activation+0.101
Top resid features:
(
Token (
Feature activation+0.039
Top resid features:
Michael
TokenMichael
Feature activation+0.009
Top resid features:
C
Token C
Feature activation+0.007
Top resid features:
aine
Tokenaine
Feature activation+0.045
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.587
Top resid features:
ile
Tokenile
Feature activation+0.058
Top resid features:
(
Token (
Feature activation+0.043
Top resid features:
Michael
TokenMichael
Feature activation+0.011
Top resid features:
C
Token C
Feature activation+0.010
Top resid features:
aine
Tokenaine
Feature activation+0.054
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.587
Top resid features:
ile
Tokenile
Feature activation+0.113
Top resid features:
(
Token (
Feature activation+0.052
Top resid features:
Michael
TokenMichael
Feature activation+0.026
Top resid features:
C
Token C
Feature activation-0.029
Top resid features:
aine
Tokenaine
Feature activation+0.089
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.647
Top resid features:
ile
Tokenile
Feature activation+0.098
Top resid features:
(
Token (
Feature activation+0.046
Top resid features:
Michael
TokenMichael
Feature activation+0.021
Top resid features:
C
Token C
Feature activation+0.013
Top resid features:
aine
Tokenaine
Feature activation+0.075
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.875
Top resid features:
ile
Tokenile
Feature activation+0.149
Top resid features:
(
Token (
Feature activation+0.047
Top resid features:
Michael
TokenMichael
Feature activation+0.043
Top resid features:
C
Token C
Feature activation+0.030
Top resid features:
aine
Tokenaine
Feature activation+0.037
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.742
Top resid features:
ile
Tokenile
Feature activation+0.144
Top resid features:
(
Token (
Feature activation+0.049
Top resid features:
Michael
TokenMichael
Feature activation+0.038
Top resid features:
C
Token C
Feature activation+0.027
Top resid features:
aine
Tokenaine
Feature activation+0.041
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.592
Top resid features:
ile
Tokenile
Feature activation+0.065
Top resid features:
(
Token (
Feature activation+0.052
Top resid features:
Michael
TokenMichael
Feature activation+0.018
Top resid features:
C
Token C
Feature activation+0.023
Top resid features:
aine
Tokenaine
Feature activation+0.101
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.706
Top resid features:
ile
Tokenile
Feature activation+0.088
Top resid features:
(
Token (
Feature activation+0.041
Top resid features:
Michael
TokenMichael
Feature activation+0.016
Top resid features:
C
Token C
Feature activation+0.019
Top resid features:
aine
Tokenaine
Feature activation+0.085
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.831
Top resid features:
ile
Tokenile
Feature activation+0.043
Top resid features:
(
Token (
Feature activation+0.050
Top resid features:
Michael
TokenMichael
Feature activation+0.039
Top resid features:
C
Token C
Feature activation+0.006
Top resid features:
aine
Tokenaine
Feature activation-0.014
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.888
Top resid features:
ile
Tokenile
Feature activation+0.056
Top resid features:
(
Token (
Feature activation+0.062
Top resid features:
Michael
TokenMichael
Feature activation+0.025
Top resid features:
C
Token C
Feature activation+0.010
Top resid features:
aine
Tokenaine
Feature activation+0.015
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.609
Top resid features:
ile
Tokenile
Feature activation+0.235
Top resid features:
(
Token (
Feature activation+0.040
Top resid features:
Michael
TokenMichael
Feature activation+0.009
Top resid features:
C
Token C
Feature activation+0.017
Top resid features:
aine
Tokenaine
Feature activation+0.180
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.771
Top resid features:
ile
Tokenile
Feature activation+0.089
Top resid features:
(
Token (
Feature activation+0.057
Top resid features:
Michael
TokenMichael
Feature activation+0.021
Top resid features:
C
Token C
Feature activation+0.033
Top resid features:
aine
Tokenaine
Feature activation+0.066
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.776
Top resid features:
ile
Tokenile
Feature activation+0.066
Top resid features:
(
Token (
Feature activation+0.045
Top resid features:
Michael
TokenMichael
Feature activation+0.042
Top resid features:
C
Token C
Feature activation+0.049
Top resid features:
aine
Tokenaine
Feature activation+0.020
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.867
Top resid features:
ile
Tokenile
Feature activation+0.100
Top resid features:
(
Token (
Feature activation+0.039
Top resid features:
Michael
TokenMichael
Feature activation+0.008
Top resid features:
C
Token C
Feature activation+0.000
Top resid features:
aine
Tokenaine
Feature activation+0.162
Top resid features:

Decoder Weights Distribution

Head 0: 0.06

Head 1: 0.07

Head 2: 0.06

Head 3: 0.04

Head 4: 0.08

Head 5: 0.06

Head 6: 0.07

Head 7: 0.16

Head 8: 0.15

Head 9: 0.12

Head 10: 0.08

Head 11: 0.07

Positive logits

1.80

1.79

enhagen1.72

フォ1.70

estern1.65

utical1.63

ğ1.61

zsche1.59

1.57

aughtered1.54

ール1.53

1.53

1.52

icrobial1.51

itamin1.51

ー�1.51

velength1.49

enthal1.49

ibaba1.46

ipal1.45

Negative logits

muff-1.44

Collider-1.43

Racer-1.42

Connor-1.35

NR-1.34

Creep-1.33

routed-1.27

thrott-1.25

whine-1.25

Pip-1.23

whining-1.22

controllers-1.22

Ghostbusters-1.21

remorse-1.20

1001-1.19

Controller-1.18

Crash-1.18

peripher-1.17

Klu-1.16

Mitch-1.16

INTERVAL 0.028 - 0.031
CONTAINS 0.000%

)
Token)
Feature activation+0.000
(
Token (
Feature activation+0.000
H
TokenH
Feature activation+0.000
ind
Tokenind
Feature activation+0.000
i
Tokeni
Feature activation+0.000
:
Token:
Feature activation+0.031
à¤
Token à¤
Feature activation+0.000
¦
Token¦
Feature activation+0.000
à¤
Tokenà¤
Feature activation+0.000
¸
Token¸
Feature activation+0.000
à¥
Tokenà¥
Feature activation+0.000

INTERVAL 0.025 - 0.028
CONTAINS 0.000%

INTERVAL 0.021 - 0.025
CONTAINS 0.000%

INTERVAL 0.018 - 0.021
CONTAINS 0.000%

INTERVAL 0.015 - 0.018
CONTAINS 0.000%

INTERVAL 0.012 - 0.015
CONTAINS 0.000%

INTERVAL 0.009 - 0.012
CONTAINS 0.000%

INTERVAL 0.006 - 0.009
CONTAINS 0.000%

INTERVAL 0.003 - 0.006
CONTAINS 0.000%

INTERVAL 0.000 - 0.003
CONTAINS 100.000%

gasp
Token gasp
Feature activation+0.000
of
Token of
Feature activation+0.000
winter
Token winter
Feature activation+0.000
-
Token -
Feature activation+0.000
we
Token we
Feature activation+0.000
can
Token can
Feature activation+0.000
only
Token only
Feature activation+0.000
assume
Token assume
Feature activation+0.000
it
Token it
Feature activation+0.000
's
Token's
Feature activation+0.000
going
Token going
Feature activation+0.000
upgrades
Token upgrades
Feature activation+0.000
but
Token but
Feature activation+0.000
we
Token we
Feature activation+0.000
hope
Token hope
Feature activation+0.000
the
Token the
Feature activation+0.000
upgrade
Token upgrade
Feature activation+0.000
will
Token will
Feature activation+0.000
hit
Token hit
Feature activation+0.000
other
Token other
Feature activation+0.000
Xiaomi
Token Xiaomi
Feature activation+0.000
phones
Token phones
Feature activation+0.000
to
Token to
Feature activation+0.000
measure
Token measure
Feature activation+0.000
it
Token it
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Photo
TokenPhoto
Feature activation+0.000
:
Token:
Feature activation+0.000
Phot
Token Phot
Feature activation+0.000
om
Tokenom
Feature activation+0.000
ult
Tokenult
Feature activation+0.000
"
Token"
Feature activation+0.000
and
Token and
Feature activation+0.000
"
Token "
Feature activation+0.000
support
Tokensupport
Feature activation+0.000
ing
Tokening
Feature activation+0.000
policies
Token policies
Feature activation+0.000
designed
Token designed
Feature activation+0.000
to
Token to
Feature activation+0.000
boost
Token boost
Feature activation+0.000
enterprise
Token enterprise
Feature activation+0.000
and
Token and
Feature activation+0.000
storage
Token storage
Feature activation+0.000
option
Token option
Feature activation+0.000
,
Token,
Feature activation+0.000
a
Token a
Feature activation+0.000
13
Token 13
Feature activation+0.000
MP
TokenMP
Feature activation+0.000
O
Token O
Feature activation+0.000
IS
TokenIS
Feature activation+0.000
+
Token+
Feature activation+0.000
rear
Token rear
Feature activation+0.000
-
Token-
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000

Top feature 9 in H1.8: (feature 21478

TOP ACTIVATIONS
MAX = 0.682

,
Token,
Feature activation+0.440
given
Token given
Feature activation+0.233
the
Token the
Feature activation+0.379
ridiculous
Token ridiculous
Feature activation+0.403
/
Token/
Feature activation+0.469
aw
Tokenaw
Feature activation+0.682
esome
Tokenesome
Feature activation+0.266
title
Token title
Feature activation+0.000
of
Token of
Feature activation+0.292
Pand
Token Pand
Feature activation+0.509
emonium
Tokenemonium
Feature activation+0.241
acting
Token acting
Feature activation+0.387
in
Token in
Feature activation+0.313
2006
Token 2006
Feature activation+0.170
and
Token and
Feature activation+0.266
has
Token has
Feature activation+0.264
starred
Token starred
Feature activation+0.634
in
Token in
Feature activation+0.319
The
Token The
Feature activation+0.241
Man
Token Man
Feature activation+0.383
with
Token with
Feature activation+0.218
the
Token the
Feature activation+0.284
Mania
TokenMania
Feature activation+0.000
XXX
Token XXX
Feature activation+0.000
.
Token.
Feature activation+0.335
Ċ
TokenĊ
Feature activation+0.428
Ċ
TokenĊ
Feature activation+0.441
B
TokenB
Feature activation+0.542
aut
Tokenaut
Feature activation+0.383
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
acting
Token acting
Feature activation+0.387
in
Token in
Feature activation+0.313
/
Token/
Feature activation+0.469
aw
Tokenaw
Feature activation+0.682
esome
Tokenesome
Feature activation+0.266
title
Token title
Feature activation+0.000
of
Token of
Feature activation+0.292
Pand
Token Pand
Feature activation+0.509
emonium
Tokenemonium
Feature activation+0.241
,
Token,
Feature activation+0.437
is
Token is
Feature activation+0.183
still
Token still
Feature activation+0.296
in
Token in
Feature activation+0.401
with
Token with
Feature activation+0.080
WWE
Token WWE
Feature activation+0.000
Studios
Token Studios
Feature activation+0.264
partnering
Token partnering
Feature activation+0.294
on
Token on
Feature activation+0.452
to
Token to
Feature activation+0.481
help
Token help
Feature activation+0.436
with
Token with
Feature activation+0.384
production
Token production
Feature activation+0.099
.
Token.
Feature activation+0.401
The
Token The
Feature activation+0.252
movie
Token movie
Feature activation+0.071
,
Token,
Feature activation+0.440
given
Token given
Feature activation+0.233
the
Token the
Feature activation+0.379
ridiculous
Token ridiculous
Feature activation+0.403
/
Token/
Feature activation+0.469
aw
Tokenaw
Feature activation+0.682
esome
Tokenesome
Feature activation+0.266
title
Token title
Feature activation+0.000
of
Token of
Feature activation+0.292
Pand
Token Pand
Feature activation+0.509
Ċ
TokenĊ
Feature activation+0.428
Ċ
TokenĊ
Feature activation+0.441
B
TokenB
Feature activation+0.542
aut
Tokenaut
Feature activation+0.383
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
acting
Token acting
Feature activation+0.387
in
Token in
Feature activation+0.313
2006
Token 2006
Feature activation+0.170
and
Token and
Feature activation+0.266
has
Token has
Feature activation+0.264
,
Token,
Feature activation+0.075
with
Token with
Feature activation+0.080
WWE
Token WWE
Feature activation+0.000
Studios
Token Studios
Feature activation+0.264
partnering
Token partnering
Feature activation+0.294
on
Token on
Feature activation+0.452
to
Token to
Feature activation+0.481
help
Token help
Feature activation+0.436
with
Token with
Feature activation+0.384
production
Token production
Feature activation+0.099
.
Token.
Feature activation+0.401
Wrestle
Token Wrestle
Feature activation+0.000
Mania
TokenMania
Feature activation+0.000
XXX
Token XXX
Feature activation+0.000
.
Token.
Feature activation+0.335
Ċ
TokenĊ
Feature activation+0.428
Ċ
TokenĊ
Feature activation+0.441
B
TokenB
Feature activation+0.542
aut
Tokenaut
Feature activation+0.383
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
acting
Token acting
Feature activation+0.387
with
Token with
Feature activation+0.384
production
Token production
Feature activation+0.099
.
Token.
Feature activation+0.401
The
Token The
Feature activation+0.252
movie
Token movie
Feature activation+0.071
,
Token,
Feature activation+0.440
given
Token given
Feature activation+0.233
the
Token the
Feature activation+0.379
ridiculous
Token ridiculous
Feature activation+0.403
/
Token/
Feature activation+0.469
aw
Tokenaw
Feature activation+0.682
esome
Tokenesome
Feature activation+0.266
title
Token title
Feature activation+0.000
of
Token of
Feature activation+0.292
Pand
Token Pand
Feature activation+0.509
emonium
Tokenemonium
Feature activation+0.241
,
Token,
Feature activation+0.437
is
Token is
Feature activation+0.183
still
Token still
Feature activation+0.296
in
Token in
Feature activation+0.401
the
Token the
Feature activation+0.378
very
Token very
Feature activation+0.127
WWE
Token WWE
Feature activation+0.000
Studios
Token Studios
Feature activation+0.264
partnering
Token partnering
Feature activation+0.294
on
Token on
Feature activation+0.452
to
Token to
Feature activation+0.481
help
Token help
Feature activation+0.436
with
Token with
Feature activation+0.384
production
Token production
Feature activation+0.099
.
Token.
Feature activation+0.401
The
Token The
Feature activation+0.252
movie
Token movie
Feature activation+0.071
headline
Token headline
Feature activation+0.000
Wrestle
Token Wrestle
Feature activation+0.000
Mania
TokenMania
Feature activation+0.000
XXX
Token XXX
Feature activation+0.000
.
Token.
Feature activation+0.335
Ċ
TokenĊ
Feature activation+0.428
Ċ
TokenĊ
Feature activation+0.441
B
TokenB
Feature activation+0.542
aut
Tokenaut
Feature activation+0.383
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
The
Token The
Feature activation+0.252
movie
Token movie
Feature activation+0.071
,
Token,
Feature activation+0.440
given
Token given
Feature activation+0.233
the
Token the
Feature activation+0.379
ridiculous
Token ridiculous
Feature activation+0.403
/
Token/
Feature activation+0.469
aw
Tokenaw
Feature activation+0.682
esome
Tokenesome
Feature activation+0.266
title
Token title
Feature activation+0.000
of
Token of
Feature activation+0.292
Pand
Token Pand
Feature activation+0.509
emonium
Tokenemonium
Feature activation+0.241
,
Token,
Feature activation+0.437
is
Token is
Feature activation+0.183
still
Token still
Feature activation+0.296
in
Token in
Feature activation+0.401
the
Token the
Feature activation+0.378
very
Token very
Feature activation+0.127
early
Token early
Feature activation+0.228
stages
Token stages
Feature activation+0.287
of
Token of
Feature activation+0.182
on
Token on
Feature activation+0.452
to
Token to
Feature activation+0.481
help
Token help
Feature activation+0.436
with
Token with
Feature activation+0.384
production
Token production
Feature activation+0.099
.
Token.
Feature activation+0.401
The
Token The
Feature activation+0.252
movie
Token movie
Feature activation+0.071
,
Token,
Feature activation+0.440
given
Token given
Feature activation+0.233
the
Token the
Feature activation+0.379
Ċ
TokenĊ
Feature activation+0.441
B
TokenB
Feature activation+0.542
aut
Tokenaut
Feature activation+0.383
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
acting
Token acting
Feature activation+0.387
in
Token in
Feature activation+0.313
2006
Token 2006
Feature activation+0.170
and
Token and
Feature activation+0.266
has
Token has
Feature activation+0.264
starred
Token starred
Feature activation+0.634
Studios
Token Studios
Feature activation+0.264
partnering
Token partnering
Feature activation+0.294
on
Token on
Feature activation+0.452
to
Token to
Feature activation+0.481
help
Token help
Feature activation+0.436
with
Token with
Feature activation+0.384
production
Token production
Feature activation+0.099
.
Token.
Feature activation+0.401
The
Token The
Feature activation+0.252
movie
Token movie
Feature activation+0.071
,
Token,
Feature activation+0.440
and
Token and
Feature activation+0.266
has
Token has
Feature activation+0.264
starred
Token starred
Feature activation+0.634
in
Token in
Feature activation+0.319
The
Token The
Feature activation+0.241
Man
Token Man
Feature activation+0.383
with
Token with
Feature activation+0.218
the
Token the
Feature activation+0.284
Iron
Token Iron
Feature activation+0.234
F
Token F
Feature activation+0.091
ists
Tokenists
Feature activation+0.278
XXX
Token XXX
Feature activation+0.000
.
Token.
Feature activation+0.335
Ċ
TokenĊ
Feature activation+0.428
Ċ
TokenĊ
Feature activation+0.441
B
TokenB
Feature activation+0.542
aut
Tokenaut
Feature activation+0.383
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
acting
Token acting
Feature activation+0.387
in
Token in
Feature activation+0.313
2006
Token 2006
Feature activation+0.170

Top DFA by src position
MAX = 0.615

<|endoftext|>
Token<|endoftext|>
Feature activation+0.478
Top resid features:
a
Token a
Feature activation+0.049
Top resid features:
dram
Token dram
Feature activation-0.007
Top resid features:
at
Tokenat
Feature activation+0.009
Top resid features:
ized
Tokenized
Feature activation+0.006
Top resid features:
biography
Token biography
Feature activation-0.031
Top resid features:
acting
Token acting
Feature activation+0.154
Top resid features:
in
Token in
Feature activation+0.176
Top resid features:
2006
Token 2006
Feature activation+0.150
Top resid features:
and
Token and
Feature activation+0.229
Top resid features:
has
Token has
Feature activation+0.306
Top resid features:
starred
Token starred
Feature activation+0.517
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
The
Token The
Feature activation+0.000
Top resid features:
Man
Token Man
Feature activation+0.000
Top resid features:
with
Token with
Feature activation+0.000
Top resid features:
the
Token the
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.520
Top resid features:
-
Token-
Feature activation+0.085
Top resid features:
signed
Tokensigned
Feature activation+0.025
Top resid features:
with
Token with
Feature activation+0.081
Top resid features:
WWE
Token WWE
Feature activation+0.198
Top resid features:
in
Token in
Feature activation+0.092
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.462
Top resid features:
a
Token a
Feature activation+0.052
Top resid features:
dram
Token dram
Feature activation-0.009
Top resid features:
at
Tokenat
Feature activation+0.001
Top resid features:
ized
Tokenized
Feature activation+0.005
Top resid features:
biography
Token biography
Feature activation-0.037
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.440
Top resid features:
a
Token a
Feature activation+0.044
Top resid features:
dram
Token dram
Feature activation-0.002
Top resid features:
at
Tokenat
Feature activation+0.008
Top resid features:
ized
Tokenized
Feature activation+0.007
Top resid features:
biography
Token biography
Feature activation-0.035
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.450
Top resid features:
a
Token a
Feature activation+0.036
Top resid features:
dram
Token dram
Feature activation-0.000
Top resid features:
at
Tokenat
Feature activation+0.004
Top resid features:
ized
Tokenized
Feature activation+0.006
Top resid features:
biography
Token biography
Feature activation-0.024
Top resid features:
Ċ
TokenĊ
Feature activation+0.235
Top resid features:
Ċ
TokenĊ
Feature activation+0.230
Top resid features:
B
TokenB
Feature activation+0.093
Top resid features:
aut
Tokenaut
Feature activation+0.090
Top resid features:
ista
Tokenista
Feature activation+0.156
Top resid features:
began
Token began
Feature activation+0.615
Top resid features:
acting
Token acting
Feature activation+0.000
Top resid features:
in
Token in
Feature activation+0.000
Top resid features:
2006
Token 2006
Feature activation+0.000
Top resid features:
and
Token and
Feature activation+0.000
Top resid features:
has
Token has
Feature activation+0.000
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.467
Top resid features:
a
Token a
Feature activation+0.048
Top resid features:
dram
Token dram
Feature activation-0.007
Top resid features:
at
Tokenat
Feature activation+0.008
Top resid features:
ized
Tokenized
Feature activation+0.008
Top resid features:
biography
Token biography
Feature activation-0.041
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.511
Top resid features:
-
Token-
Feature activation+0.073
Top resid features:
signed
Tokensigned
Feature activation+0.031
Top resid features:
with
Token with
Feature activation+0.060
Top resid features:
WWE
Token WWE
Feature activation+0.293
Top resid features:
in
Token in
Feature activation+0.069
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.439
Top resid features:
a
Token a
Feature activation+0.042
Top resid features:
dram
Token dram
Feature activation+0.002
Top resid features:
at
Tokenat
Feature activation+0.009
Top resid features:
ized
Tokenized
Feature activation+0.007
Top resid features:
biography
Token biography
Feature activation-0.035
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.422
Top resid features:
a
Token a
Feature activation+0.029
Top resid features:
dram
Token dram
Feature activation+0.002
Top resid features:
at
Tokenat
Feature activation+0.002
Top resid features:
ized
Tokenized
Feature activation+0.006
Top resid features:
biography
Token biography
Feature activation-0.029
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.435
Top resid features:
a
Token a
Feature activation+0.050
Top resid features:
dram
Token dram
Feature activation-0.004
Top resid features:
at
Tokenat
Feature activation+0.008
Top resid features:
ized
Tokenized
Feature activation+0.008
Top resid features:
biography
Token biography
Feature activation-0.035
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.523
Top resid features:
-
Token-
Feature activation+0.069
Top resid features:
signed
Tokensigned
Feature activation+0.035
Top resid features:
with
Token with
Feature activation+0.054
Top resid features:
WWE
Token WWE
Feature activation+0.314
Top resid features:
in
Token in
Feature activation+0.066
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.427
Top resid features:
a
Token a
Feature activation+0.034
Top resid features:
dram
Token dram
Feature activation-0.004
Top resid features:
at
Tokenat
Feature activation+0.004
Top resid features:
ized
Tokenized
Feature activation+0.011
Top resid features:
biography
Token biography
Feature activation-0.039
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.423
Top resid features:
a
Token a
Feature activation+0.023
Top resid features:
dram
Token dram
Feature activation+0.001
Top resid features:
at
Tokenat
Feature activation+0.000
Top resid features:
ized
Tokenized
Feature activation+0.006
Top resid features:
biography
Token biography
Feature activation-0.030
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.460
Top resid features:
a
Token a
Feature activation+0.049
Top resid features:
dram
Token dram
Feature activation+0.001
Top resid features:
at
Tokenat
Feature activation+0.008
Top resid features:
ized
Tokenized
Feature activation+0.009
Top resid features:
biography
Token biography
Feature activation-0.039
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.481
Top resid features:
-
Token-
Feature activation+0.053
Top resid features:
signed
Tokensigned
Feature activation+0.026
Top resid features:
with
Token with
Feature activation+0.064
Top resid features:
WWE
Token WWE
Feature activation+0.187
Top resid features:
in
Token in
Feature activation+0.075
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.453
Top resid features:
a
Token a
Feature activation+0.042
Top resid features:
dram
Token dram
Feature activation-0.002
Top resid features:
at
Tokenat
Feature activation+0.007
Top resid features:
ized
Tokenized
Feature activation+0.008
Top resid features:
biography
Token biography
Feature activation-0.031
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.490
Top resid features:
-
Token-
Feature activation+0.060
Top resid features:
signed
Tokensigned
Feature activation+0.014
Top resid features:
with
Token with
Feature activation+0.054
Top resid features:
WWE
Token WWE
Feature activation+0.162
Top resid features:
in
Token in
Feature activation+0.068
Top resid features:
<|endoftext|>
Token<|endoftext|>
Feature activation+0.552
Top resid features:
-
Token-
Feature activation+0.114
Top resid features:
signed
Tokensigned
Feature activation+0.035
Top resid features:
with
Token with
Feature activation+0.109
Top resid features:
WWE
Token WWE
Feature activation+0.255
Top resid features:
in
Token in
Feature activation+0.111
Top resid features:

Decoder Weights Distribution

Head 0: 0.05

Head 1: 0.06

Head 2: 0.06

Head 3: 0.04

Head 4: 0.08

Head 5: 0.07

Head 6: 0.07

Head 7: 0.17

Head 8: 0.15

Head 9: 0.11

Head 10: 0.07

Head 11: 0.07

Positive logits

Regular1.40

":["1.38

Beast1.29

1.27

Insert1.27

sequels1.20

Season1.20

DragonMagazine1.19

Characters1.17

ortal1.15

Haunted1.12

™:1.11

Modified1.11

Ghost1.10

idays1.10

xual1.08

repeat1.08

guiActiveUnfocused1.08

Spiel1.06

ado1.05

Negative logits

ndra-1.43

Shutterstock-1.29

ois-1.25

Sutton-1.22

yson-1.19

scr-1.17

nan-1.16

iane-1.15

ando-1.14

oshenko-1.13

tru-1.13

Tory-1.13

lib-1.12

iffe-1.11

barr-1.11

corpor-1.10

graphene-1.10

ricia-1.09

enclave-1.07

polic-1.07

INTERVAL 0.614 - 0.682
CONTAINS 0.000%

,
Token,
Feature activation+0.440
given
Token given
Feature activation+0.233
the
Token the
Feature activation+0.379
ridiculous
Token ridiculous
Feature activation+0.403
/
Token/
Feature activation+0.469
aw
Tokenaw
Feature activation+0.682
esome
Tokenesome
Feature activation+0.266
title
Token title
Feature activation+0.000
of
Token of
Feature activation+0.292
Pand
Token Pand
Feature activation+0.509
emonium
Tokenemonium
Feature activation+0.241
acting
Token acting
Feature activation+0.387
in
Token in
Feature activation+0.313
2006
Token 2006
Feature activation+0.170
and
Token and
Feature activation+0.266
has
Token has
Feature activation+0.264
starred
Token starred
Feature activation+0.634
in
Token in
Feature activation+0.319
The
Token The
Feature activation+0.241
Man
Token Man
Feature activation+0.383
with
Token with
Feature activation+0.218
the
Token the
Feature activation+0.284

INTERVAL 0.546 - 0.614
CONTAINS 0.000%

INTERVAL 0.478 - 0.546
CONTAINS 0.000%

with
Token with
Feature activation+0.080
WWE
Token WWE
Feature activation+0.000
Studios
Token Studios
Feature activation+0.264
partnering
Token partnering
Feature activation+0.294
on
Token on
Feature activation+0.452
to
Token to
Feature activation+0.481
help
Token help
Feature activation+0.436
with
Token with
Feature activation+0.384
production
Token production
Feature activation+0.099
.
Token.
Feature activation+0.401
The
Token The
Feature activation+0.252
/
Token/
Feature activation+0.469
aw
Tokenaw
Feature activation+0.682
esome
Tokenesome
Feature activation+0.266
title
Token title
Feature activation+0.000
of
Token of
Feature activation+0.292
Pand
Token Pand
Feature activation+0.509
emonium
Tokenemonium
Feature activation+0.241
,
Token,
Feature activation+0.437
is
Token is
Feature activation+0.183
still
Token still
Feature activation+0.296
in
Token in
Feature activation+0.401
Mania
TokenMania
Feature activation+0.000
XXX
Token XXX
Feature activation+0.000
.
Token.
Feature activation+0.335
Ċ
TokenĊ
Feature activation+0.428
Ċ
TokenĊ
Feature activation+0.441
B
TokenB
Feature activation+0.542
aut
Tokenaut
Feature activation+0.383
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
acting
Token acting
Feature activation+0.387
in
Token in
Feature activation+0.313

INTERVAL 0.409 - 0.478
CONTAINS 0.001%

Ċ
TokenĊ
Feature activation+0.428
Ċ
TokenĊ
Feature activation+0.441
B
TokenB
Feature activation+0.542
aut
Tokenaut
Feature activation+0.383
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
acting
Token acting
Feature activation+0.387
in
Token in
Feature activation+0.313
2006
Token 2006
Feature activation+0.170
and
Token and
Feature activation+0.266
has
Token has
Feature activation+0.264
headline
Token headline
Feature activation+0.000
Wrestle
Token Wrestle
Feature activation+0.000
Mania
TokenMania
Feature activation+0.000
XXX
Token XXX
Feature activation+0.000
.
Token.
Feature activation+0.335
Ċ
TokenĊ
Feature activation+0.428
Ċ
TokenĊ
Feature activation+0.441
B
TokenB
Feature activation+0.542
aut
Tokenaut
Feature activation+0.383
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
with
Token with
Feature activation+0.384
production
Token production
Feature activation+0.099
.
Token.
Feature activation+0.401
The
Token The
Feature activation+0.252
movie
Token movie
Feature activation+0.071
,
Token,
Feature activation+0.440
given
Token given
Feature activation+0.233
the
Token the
Feature activation+0.379
ridiculous
Token ridiculous
Feature activation+0.403
/
Token/
Feature activation+0.469
aw
Tokenaw
Feature activation+0.682
,
Token,
Feature activation+0.075
with
Token with
Feature activation+0.080
WWE
Token WWE
Feature activation+0.000
Studios
Token Studios
Feature activation+0.264
partnering
Token partnering
Feature activation+0.294
on
Token on
Feature activation+0.452
to
Token to
Feature activation+0.481
help
Token help
Feature activation+0.436
with
Token with
Feature activation+0.384
production
Token production
Feature activation+0.099
.
Token.
Feature activation+0.401
movie
Token movie
Feature activation+0.071
,
Token,
Feature activation+0.440
given
Token given
Feature activation+0.233
the
Token the
Feature activation+0.379
ridiculous
Token ridiculous
Feature activation+0.403
/
Token/
Feature activation+0.469
aw
Tokenaw
Feature activation+0.682
esome
Tokenesome
Feature activation+0.266
title
Token title
Feature activation+0.000
of
Token of
Feature activation+0.292
Pand
Token Pand
Feature activation+0.509

INTERVAL 0.341 - 0.409
CONTAINS 0.001%

Pand
Token Pand
Feature activation+0.509
emonium
Tokenemonium
Feature activation+0.241
,
Token,
Feature activation+0.437
is
Token is
Feature activation+0.183
still
Token still
Feature activation+0.296
in
Token in
Feature activation+0.401
the
Token the
Feature activation+0.378
very
Token very
Feature activation+0.127
early
Token early
Feature activation+0.228
stages
Token stages
Feature activation+0.287
of
Token of
Feature activation+0.182
.
Token.
Feature activation+0.401
The
Token The
Feature activation+0.252
movie
Token movie
Feature activation+0.071
,
Token,
Feature activation+0.440
given
Token given
Feature activation+0.233
the
Token the
Feature activation+0.379
ridiculous
Token ridiculous
Feature activation+0.403
/
Token/
Feature activation+0.469
aw
Tokenaw
Feature activation+0.682
esome
Tokenesome
Feature activation+0.266
title
Token title
Feature activation+0.000
between
Token between
Feature activation+0.000
not
Token not
Feature activation+0.094
only
Token only
Feature activation+0.055
Triple
Token Triple
Feature activation+0.000
H
Token H
Feature activation+0.000
and
Token and
Feature activation+0.369
Les
Token Les
Feature activation+0.000
nar
Tokennar
Feature activation+0.000
,
Token,
Feature activation+0.206
but
Token but
Feature activation+0.076
Brock
Token Brock
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.441
B
TokenB
Feature activation+0.542
aut
Tokenaut
Feature activation+0.383
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
acting
Token acting
Feature activation+0.387
in
Token in
Feature activation+0.313
2006
Token 2006
Feature activation+0.170
and
Token and
Feature activation+0.266
has
Token has
Feature activation+0.264
starred
Token starred
Feature activation+0.634
and
Token and
Feature activation+0.266
has
Token has
Feature activation+0.264
starred
Token starred
Feature activation+0.634
in
Token in
Feature activation+0.319
The
Token The
Feature activation+0.241
Man
Token Man
Feature activation+0.383
with
Token with
Feature activation+0.218
the
Token the
Feature activation+0.284
Iron
Token Iron
Feature activation+0.234
F
Token F
Feature activation+0.091
ists
Tokenists
Feature activation+0.278

INTERVAL 0.273 - 0.341
CONTAINS 0.001%

B
TokenB
Feature activation+0.542
aut
Tokenaut
Feature activation+0.383
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
acting
Token acting
Feature activation+0.387
in
Token in
Feature activation+0.313
2006
Token 2006
Feature activation+0.170
and
Token and
Feature activation+0.266
has
Token has
Feature activation+0.264
starred
Token starred
Feature activation+0.634
in
Token in
Feature activation+0.319
still
Token still
Feature activation+0.296
in
Token in
Feature activation+0.401
the
Token the
Feature activation+0.378
very
Token very
Feature activation+0.127
early
Token early
Feature activation+0.228
stages
Token stages
Feature activation+0.287
of
Token of
Feature activation+0.182
having
Token having
Feature activation+0.241
a
Token a
Feature activation+0.321
script
Token script
Feature activation+0.000
finalized
Token finalized
Feature activation+0.000
.
Token.
Feature activation+0.335
Ċ
TokenĊ
Feature activation+0.428
Ċ
TokenĊ
Feature activation+0.441
B
TokenB
Feature activation+0.542
aut
Tokenaut
Feature activation+0.383
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
acting
Token acting
Feature activation+0.387
in
Token in
Feature activation+0.313
2006
Token 2006
Feature activation+0.170
and
Token and
Feature activation+0.266
starred
Token starred
Feature activation+0.634
in
Token in
Feature activation+0.319
The
Token The
Feature activation+0.241
Man
Token Man
Feature activation+0.383
with
Token with
Feature activation+0.218
the
Token the
Feature activation+0.284
Iron
Token Iron
Feature activation+0.234
F
Token F
Feature activation+0.091
ists
Tokenists
Feature activation+0.278
(
Token (
Feature activation+0.078
2012
Token2012
Feature activation+0.000
Man
Token Man
Feature activation+0.383
with
Token with
Feature activation+0.218
the
Token the
Feature activation+0.284
Iron
Token Iron
Feature activation+0.234
F
Token F
Feature activation+0.091
ists
Tokenists
Feature activation+0.278
(
Token (
Feature activation+0.078
2012
Token2012
Feature activation+0.000
),
Token),
Feature activation+0.320
R
Token R
Feature activation+0.201
idd
Tokenidd
Feature activation+0.000

INTERVAL 0.205 - 0.273
CONTAINS 0.002%

2000
Token 2000
Feature activation+0.000
to
Token to
Feature activation+0.000
2014
Token 2014
Feature activation+0.000
,
Token,
Feature activation+0.000
there
Token there
Feature activation+0.000
were
Token were
Feature activation+0.238
15
Token 15
Feature activation+0.000
different
Token different
Feature activation+0.000
Heisman
Token Heisman
Feature activation+0.000
winners
Token winners
Feature activation+0.000
,
Token,
Feature activation+0.000
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
acting
Token acting
Feature activation+0.387
in
Token in
Feature activation+0.313
2006
Token 2006
Feature activation+0.170
and
Token and
Feature activation+0.266
has
Token has
Feature activation+0.264
starred
Token starred
Feature activation+0.634
in
Token in
Feature activation+0.319
The
Token The
Feature activation+0.241
Man
Token Man
Feature activation+0.383
production
Token production
Feature activation+0.099
.
Token.
Feature activation+0.401
The
Token The
Feature activation+0.252
movie
Token movie
Feature activation+0.071
,
Token,
Feature activation+0.440
given
Token given
Feature activation+0.233
the
Token the
Feature activation+0.379
ridiculous
Token ridiculous
Feature activation+0.403
/
Token/
Feature activation+0.469
aw
Tokenaw
Feature activation+0.682
esome
Tokenesome
Feature activation+0.266
to
Token to
Feature activation+0.481
help
Token help
Feature activation+0.436
with
Token with
Feature activation+0.384
production
Token production
Feature activation+0.099
.
Token.
Feature activation+0.401
The
Token The
Feature activation+0.252
movie
Token movie
Feature activation+0.071
,
Token,
Feature activation+0.440
given
Token given
Feature activation+0.233
the
Token the
Feature activation+0.379
ridiculous
Token ridiculous
Feature activation+0.403
aw
Tokenaw
Feature activation+0.682
esome
Tokenesome
Feature activation+0.266
title
Token title
Feature activation+0.000
of
Token of
Feature activation+0.292
Pand
Token Pand
Feature activation+0.509
emonium
Tokenemonium
Feature activation+0.241
,
Token,
Feature activation+0.437
is
Token is
Feature activation+0.183
still
Token still
Feature activation+0.296
in
Token in
Feature activation+0.401
the
Token the
Feature activation+0.378

INTERVAL 0.136 - 0.205
CONTAINS 0.001%

entire
Token entire
Feature activation+0.000
WWE
Token WWE
Feature activation+0.000
infrastructure
Token infrastructure
Feature activation+0.257
.
Token.
Feature activation+0.221
I
Token I
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.172
Ļ
TokenĻ
Feature activation+0.000
m
Tokenm
Feature activation+0.000
excited
Token excited
Feature activation+0.000
to
Token to
Feature activation+0.000
see
Token see
Feature activation+0.000
aut
Tokenaut
Feature activation+0.383
ista
Tokenista
Feature activation+0.313
began
Token began
Feature activation+0.459
acting
Token acting
Feature activation+0.387
in
Token in
Feature activation+0.313
2006
Token 2006
Feature activation+0.170
and
Token and
Feature activation+0.266
has
Token has
Feature activation+0.264
starred
Token starred
Feature activation+0.634
in
Token in
Feature activation+0.319
The
Token The
Feature activation+0.241
the
Token the
Feature activation+0.000
project
Token project
Feature activation+0.000
,
Token,
Feature activation+0.000
but
Token but
Feature activation+0.000
there
Token there
Feature activation+0.000
is
Token is
Feature activation+0.184
one
Token one
Feature activation+0.053
glaring
Token glaring
Feature activation+0.037
question
Token question
Feature activation+0.000
that
Token that
Feature activation+0.000
needs
Token needs
Feature activation+0.000
F
Token F
Feature activation+0.091
ists
Tokenists
Feature activation+0.278
(
Token (
Feature activation+0.078
2012
Token2012
Feature activation+0.000
),
Token),
Feature activation+0.320
R
Token R
Feature activation+0.201
idd
Tokenidd
Feature activation+0.000
ick
Tokenick
Feature activation+0.099
(
Token (
Feature activation+0.085
2013
Token2013
Feature activation+0.000
),
Token),
Feature activation+0.212
having
Token having
Feature activation+0.241
a
Token a
Feature activation+0.321
script
Token script
Feature activation+0.000
finalized
Token finalized
Feature activation+0.000
.
Token.
Feature activation+0.291
No
Token No
Feature activation+0.154
director
Token director
Feature activation+0.000
or
Token or
Feature activation+0.000
actors
Token actors
Feature activation+0.000
are
Token are
Feature activation+0.000
attached
Token attached
Feature activation+0.000

INTERVAL 0.068 - 0.136
CONTAINS 0.002%

a
Token a
Feature activation+0.184
much
Token much
Feature activation+0.207
longer
Token longer
Feature activation+0.000
program
Token program
Feature activation+0.000
between
Token between
Feature activation+0.000
not
Token not
Feature activation+0.094
only
Token only
Feature activation+0.055
Triple
Token Triple
Feature activation+0.000
H
Token H
Feature activation+0.000
and
Token and
Feature activation+0.369
Les
Token Les
Feature activation+0.000
backing
Token backing
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
WWE
Token WWE
Feature activation+0.000
CEO
Token CEO
Feature activation+0.000
,
Token,
Feature activation+0.075
with
Token with
Feature activation+0.080
WWE
Token WWE
Feature activation+0.000
Studios
Token Studios
Feature activation+0.264
partnering
Token partnering
Feature activation+0.294
on
Token on
Feature activation+0.452
partnering
Token partnering
Feature activation+0.294
on
Token on
Feature activation+0.452
to
Token to
Feature activation+0.481
help
Token help
Feature activation+0.436
with
Token with
Feature activation+0.384
production
Token production
Feature activation+0.099
.
Token.
Feature activation+0.401
The
Token The
Feature activation+0.252
movie
Token movie
Feature activation+0.071
,
Token,
Feature activation+0.440
given
Token given
Feature activation+0.233
weight
Tokenweight
Feature activation+0.000
Champion
Token Champion
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Deb
TokenDeb
Feature activation+0.106
uting
Tokenuting
Feature activation+0.000
in
Token in
Feature activation+0.000
1997
Token 1997
Feature activation+0.000
,
Token,
Feature activation+0.023
Mak
Token Mak
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
WWE
Token WWE
Feature activation+0.000
CEO
Token CEO
Feature activation+0.000
,
Token,
Feature activation+0.075
with
Token with
Feature activation+0.080
WWE
Token WWE
Feature activation+0.000
Studios
Token Studios
Feature activation+0.264
partnering
Token partnering
Feature activation+0.294
on
Token on
Feature activation+0.452
to
Token to
Feature activation+0.481

INTERVAL 0.000 - 0.068
CONTAINS 99.991%

tangible
Token tangible
Feature activation+0.000
signs
Token signs
Feature activation+0.000
of
Token of
Feature activation+0.000
optimism
Token optimism
Feature activation+0.000
.
Token.
Feature activation+0.000
After
Token After
Feature activation+0.000
being
Token being
Feature activation+0.000
eliminated
Token eliminated
Feature activation+0.000
from
Token from
Feature activation+0.000
the
Token the
Feature activation+0.000
2014
Token 2014
Feature activation+0.000
financial
Token financial
Feature activation+0.000
review
Token review
Feature activation+0.000
is
Token is
Feature activation+0.000
expected
Token expected
Feature activation+0.000
in
Token in
Feature activation+0.000
less
Token less
Feature activation+0.000
than
Token than
Feature activation+0.000
two
Token two
Feature activation+0.000
weeks
Token weeks
Feature activation+0.000
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
far
Token far
Feature activation+0.000
more
Token more
Feature activation+0.000
stability
Token stability
Feature activation+0.000
during
Token during
Feature activation+0.000
his
Token his
Feature activation+0.000
eight
Token eight
Feature activation+0.000
-
Token-
Feature activation+0.000
year
Tokenyear
Feature activation+0.000
career
Token career
Feature activation+0.000
.
Token.
Feature activation+0.000
Tom
Token Tom
Feature activation+0.000
dependency
Token dependency
Feature activation+0.000
1
Token 1
Feature activation+0.000
Tue
Token Tue
Feature activation+0.000
Sep
Token Sep
Feature activation+0.000
24
Token 24
Feature activation+0.000
03
Token 03
Feature activation+0.000
:
Token:
Feature activation+0.000
21
Token21
Feature activation+0.000
:
Token:
Feature activation+0.000
25
Token25
Feature activation+0.000
2013
Token 2013
Feature activation+0.000
the
Token the
Feature activation+0.000
film
Token film
Feature activation+0.000
arrived
Token arrived
Feature activation+0.000
in
Token in
Feature activation+0.000
theaters
Token theaters
Feature activation+0.000
.
Token.
Feature activation+0.000
However
Token However
Feature activation+0.000
,
Token,
Feature activation+0.000
that
Token that
Feature activation+0.000
was
Token was
Feature activation+0.000
never
Token never
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
bonds
Token bonds
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
sc
Token sc
Feature activation+0.000
iss
Tokeniss
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
ile
Tokenile
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
(
Token (
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Michael
TokenMichael
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
is
Token is
Feature activation+0.000
regulated
Token regulated
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
residues
Token residues
Feature activation+0.000
fl
Token fl
Feature activation+0.000
anking
Tokenanking
Feature activation+0.000
the
Token the
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
ag
Tokenag
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000
G
Token G
Feature activation+0.000
C
Token C
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
aine
Tokenaine
Feature activation+0.000
)
Token)
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
process
Token process
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
processing
Token processing
Feature activation+0.000
of
Token of
Feature activation+0.000